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Preface 


PREFACE TO SECOND EDITION 


Much has (or has not) happened in the seven years or so since the publication of the first 
edition of this book. Most notably, a new spin-0 particle with mass ~125 GeV was discovered 
at the LHC by the ATLAS and CMS collaborations, which is either the elementary Higgs 
boson or something that closely resembles it. The Higgs discovery completes the = 40 
year saga of verifying the standard model (SM). Moreover, its mass is almost maximally 
interesting: it is near the top of the range predicted by the most popular SM extension, 
minimal supersymmetry, and near the minimum value consistent with the unextended SM 
(and then only if the vacuum is metastable)! 

However, the notorious problems of the SM are still unresolved. Perhaps the most press- 
ing is the apparently fine-tuned hierarchy between the weak interaction and gravity scales. 
Extensive searches at the LHC and elsewhere have so far not yielded any compelling evidence 
for new TeV-scale physics such as supersymmetry, extra space dimensions, or strong cou- 
pling that had been proposed to explain or at least stabilize the hierarchy. Equally puzzling 
is the nature of the dark energy and its incredibly tiny magnitude compared to most theo- 
retical expectations. Similarly, numerous experimental attempts to identify the mysterious 
dark matter inferred from its gravitational effects have not as yet had any positive results 
and have excluded much of the parameter space for supersymmetric dark matter. And de- 
spite the great experimental success of the SM, it is a very complicated theory, involving 
several interactions with different properties, and two apparently superfluous heavier copies 
of the fundamental particles that constitute ordinary matter under ordinary conditions. For 
these and other reasons, many theorists have started exploring less canonical possibilities, 
such as a dark matter sector that is at most very weakly coupled to ordinary particles, 
or, more radically, that the Universe is part of a vast multiverse of regions (presumably 
associated with a superstring landscape of vacua) with different laws of physics. 

The existing experimental programs in high energy physics will continue for many years. 
These include high luminosity running at the LHC; active programs around the world in 
neutrino, flavor, and dark matter physics; and observational probes of the dark energy. 
There are also proposed next generation facilities such as new ete” colliders that can serve 
as Higgs factories, and O(100 TeV) hadron colliders. We will most likely find evidence 
for any multi-TeV scale physics relevant to the hierarchy problem or that is “just there” 
as a remnant of a more basic underlying theory; hopefully identify the dark matter and 
energy and shed light on the origin of the baryon asymmetry; perhaps progress toward a 
fundamental grand unification, superstring, or other theory that no one has yet imagined; 
and even reconsider such paradigms as naturalness, uniqueness, and minimality.! 

Like the first edition, this volume is intended to serve as a detailed text and refer- 
ence on the formalism, technology, phenomenology, and experimental verification of the 
standard model and its possible extensions. In addition to updating all of the experimen- 
tal and phenomenological results, it contains expanded discussions of collider, Higgs, neu- 
trino, and dark matter physics, and includes many new problems. The book website at 


lFor more extensive speculations along these lines, see, e.g., (Langacker, 2017). 
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www.sas.upenn.edu/~pgl1/SMB2/ includes various supplemental materials, suggestions for 
use in a one-semester course, and corrections. 

I would like to thank Vernon Barger and Jonathan Heckman for critiquing parts of this 
new edition, all those who have commented on the first one, and Irmgard for her extreme 
patience during the preparation of this new version. 


Paul Langacker 
December 8, 2016 


PREFACE TO FIRST EDITION 


In the last few decades there has been a tremendous advance in our understanding of the 
elementary particles and their interactions. We now have a mathematically consistent theory 
of the strong, electromagnetic, and weak interactions—the standard model—most aspects of 
which have been successfully tested in detail at colliders, accelerators, and non-accelerator 
experiments. It also provides a successful framework and has been strongly constrained by 
many observations in cosmology and astrophysics. The standard model is almost certainly 
an approximately correct description of Nature down to a distance scale 1/1000th the size 
of the atomic nucleus. 

However, nobody believes that the standard model is the ultimate theory: it is too 
complicated and arbitrary, does not provide an understanding of the patterns of fermion 
masses and mixings, does not incorporate quantum gravity, and it involves several severe 
fine-tunings. Furthermore, the origins of electroweak symmetry breaking, whether by the 
Higgs mechanism or something else, are uncertain. The recent discovery of non-zero neutrino 
mass can be incorporated, but in more than one way, with different implications for physics 
at very short distance scales. Finally, the observations of dark matter and energy suggest 
new particle physics beyond the standard model. 

Most current activity is directed toward discovering the new physics which must underlie 
the standard model. Much of the theoretical effort involves constructing models of possible 
new physics at the TeV scale, such as supersymmetry or alternative models of spontaneous 
symmetry breaking. Others are examining the extremely promising ideas of superstring 
theory, which offer the hope of an ultimate unification of all interactions including gravity. 
There is a lively debate about the implications of a landscape of possible string vacua, 
and serious efforts are being made to explore the consequences of string theory for the 
TeV scale. It is likely that a combination of such bottom-up and top-down ideas will be 
necessary for progress. In any case, new experimental data are urgently needed. At the 
time of this writing, the particle physics community is eagerly awaiting the results of the 
Large Hadron Collider (LHC) and is optimistic about a possible future International Linear 
Collider. Future experiments to elucidate the properties of neutrinos and to explore aspects 
of flavor, and more detailed probes of the dark energy and dark matter, are also anticipated. 

The purpose of this volume is to provide an advanced introduction to the physics and 
formalism of the standard model and other non-abelian gauge theories, and thus to provide 
a thorough background for topics such as supersymmetry, string theory, extra dimensions, 
dynamical symmetry breaking, and cosmology. It is intended to provide the tools for a 
researcher to understand the structure and phenomenological consequences of the standard 
model, construct extensions, and to carry out calculations at tree level. Some “old-fashioned” 
topics which may still be useful are included. This is not a text on field theory, and does not 
substitute for the excellent texts that already exist. Ideally, the reader will have completed 
a standard field theory course. Nevertheless, Chapter 2 of this book presents a largely 
self-contained treatment of the complicated technology needed for tree-level calculations 
involving spin-0, spin-3, and spin-1 particles, and should be useful for those who have not 
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studied field theory recently, or whose exposure has been more formal than calculational.? 
It does not attempt to deal systematically with the subtleties of renormalization, gauge 
issues, or higher-order corrections. An introductory-level background in the ideas of particle 
physics is assumed, with occasional reference to topics such as gluons or supersymmetry 
before they are formally introduced. Similarly, occasional reference is made to applications 
to and constraints from astrophysics and cosmology. The necessary background material 
may be found in the sources listed in the bibliography. 

Chapter 1 is a short summary of notations and conventions and of some basic mathe- 
matical machinery. Chapter 2 contains a review of calculational techniques in field theory 
and the status of quantum electrodynamics. Chapters 3 and 4 are concerned with global 
and local symmetries and the construction of non-abelian gauge theories. Chapter 5 ex- 
amines the strong interactions and the structure and tests of Quantum Chromodynamics 
(QCD). Chapters? 6 and 7 examine the electroweak interactions and theory, including neu- 
trino masses. Chapter 8 considers the motivations for extending the standard model, and 
examines supersymmetry, extended gauge groups, and grand unification. There are short 
appendices on additional topics. The bibliographies list many useful reference books, re- 
view articles, research papers, and websites. No attempt has been made to list all relevant 
original articles, with preference given instead to later articles and books that can be used 
to track down the original ones. Supplementary materials and corrections are available at 
http://www.sns.ias.edu/~pg1/SMB/. Comments, corrections, and typographical errors 
can also be sent through that site. 

I would like to thank Mirjam Cvetic, Jens Erler, Hye-Sung Lee, Gil Paz, Liantao Wang, 
and Itay Yavin for reading and commenting on parts of the manuscript, Lisa Fleischer for 
help in the preparation of the manuscript, and my wife Irmgard for her extreme patience 
during the writing. 


Paul Langacker 
July 4, 2009 


?Most calculations, especially at the tree-level, are now carried out by specialized computer programs, 
many of which are included in the list of websites, but it is still important to understand the techniques 
that go into them. Some examples may be found in the notebooks on the book website. 

3These chapter numbers refer to the first edition. 
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CHAPTER 1 


Notation and Conventions 
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In this chapter we briefly survey our notation and conventions. 


Conventions 


We generally follow the conventions used in (Langacker, 1981). In particular, (u,v, p, o) 
are Lorentz indices; (i,j,k = 1---3) are three-vector indices; (i,j,k = 1---N) are also 
used to label group generators or elements of the adjoint representation; (a,b,c) run over 
the elements of a representation, while (a, 6,7) and (r,s,t) refer to the special cases of 
color and flavor, respectively. (a, 8) are also occasionally used for Dirac indices. (m,n) are 
used as horizontal (family) indices, labeling repeated fermions, scalars, and representations. 
The summation convention applies to all repeated indices except where indicated. Opera- 
tors are represented by capital letters (T’,Q,Y), their eigenvalues by the same symbols or 
by lower case letters! (t',¢,y), and their matrix representations by (L', Lg, Ly). In Feyn- 
man diagrams, ordinary fermions are represented by solid lines; spin-0 particles by dashed 
lines; gluons by curly lines; other gauge bosons by wavy lines; and gluinos, neutralinos, 
and charginos by double lines. Experimental errors are usually quoted as a single num- 
ber, with statistical, systematic, and theoretical uncertainties combined in quadrature and 
asymmetric errors symmetrized. 


Units and Physical Constants 


We take A = c = 1, implying that E, p, m + have “energy units,” such as electron 


volts (eV).? Related energy units are 


> m? 


1 eV = 10° meV = 107? keV = 107° MeV = 107°? GeV 


1.1 
= 107}? TeV = 1071 PeV = 107!’ EeV, (11) 


where the prefixes represent milli, kilo, Mega, Giga, Tera, Peta, and Exa, respectively. One 
can restore conventional units at the end of a calculation using the values of h,c, and hc 
listed in Table 1.1. We use Heaviside-Lorentz units, in which the fine structure constant is 
a = e? /4r, where e > 0 is the charge of the positron. 


lOr sometimes er for the electric charge of the rth quark. 
2Most likely only dimensionless quantities, such as a or ratios of masses, are fundamental. 


2 The Standard Model and Beyond 


TABLE 1.1 Conversions and physical constants. 


ħ ~ 6.6 x 1077? MeV-s c~ 3.0 x 101? cm/s he ~ 197 MeV-fm 


aq! ~ 137.04 a—!(MZ) ~ 128.9 sin? 6y(M2) ~ 0.2313 

ag( M2) ~ 0.034 agı (M2) ~ 0.010 as(M2) ~ 0.118 

Gpr ~ 1.17 x 1075 GeV™? Mw ~ 80.39 GeV Mz ~ 91.19 GeV 

me ~ 0.511 MeV My ~ 105.7 MeV m, ~ 1.78 GeV 

Mp ~ 938 MeV M+ ~ 140 MeV ms ~ 494 MeV 

Mp ~1.22x10!9 GeV My ~ 125 GeV 1g~5.6 x 1073 GeV 

k ~ 1.16 x 104 °K/eV 1 barn = 107?4 cm? 1 yr ~ 3.16 x 107s ~r x 107 s 


For more precise values, see (Patrignani, 2016). The Planck constant is Mp = oo, where Gy is the 


gravitational constant. 


Operators and Matrices 


The commutator and anti-commutator of two operators or matrices are 
[A, B] = AB — BA, {A,B} = AB + BA. (1.2) 
The transpose, adjoint, and trace of an n x n matrix M are 


transpose: MT (MJ, = Mba), adjoint: Mt = M7™* (1.3) 


a 


trace: TrM =Ñ Ma, Tr(M,M2) = Tr (MM1), TrM =TM". (1.4) 


a=1 


Vectors, Metric, and Relativity 


Three-vectors and unit vectors are denoted by # and ĉ = Z/|zZ|, respectively. We do not 
distinguish between upper and lower indices for three-vectors; e.g., the inner (dot) product 
T- Į may be written as zy’, x'y;, or ziyi. The Levi-Civita tensor €;;,, where i,j,k = 1---3, 
is totally antisymmetric, with €123 = 1. Its contractions are 


EijkEijk = Ô, EijkEijm = 20km, €igk€imn = OjmOkn — Ôjnôkm, (1.5) 
where the Kronecker delta function is 
_jJ ltj 
bij = { O iLj (1.6) 
Eijk is useful for vector cross products and their identities. For example, 
(A x B); = eij Aj By (1.7) 


(A x B). (Č x D) = cijkeimAjBrCiDm = (4 - C)(B- D) — (4 - D)(B-C). (1.8) 


Notations for four-vectors and the metric are given in Table 1.2. 
The four-momentum of a particle with mass m is p“ = (E, p) with p? = E? — p? = m?. 
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(The symbol p is occasionally used to represent |p| rather than a four-vector, but the 

meaning should always be clear from the context.) The velocity 6 and energy are given by 
> P E 1 

Under a Lorentz boost by velocity B L (the relativistic addition of B Lto 8 , which is equivalent 

to going to a new Lorentz frame moving with — 8z) 


p" > p” = (E',p"), (1.10) 
where z 
E' =9L(E +8- P), P’ =p. +71(p) + BLE), (1.11) 
with 1 
Pi = LBL : P, PL =P- Pi JL = >>: (1.12) 
vi-t 
TABLE 1.2 Notations and conventions for four-vectors and the metric 
Contravariant four-vector A” = (A°, A), xt = (t, Z) 
Covariant four-vector Ap = Juw A" = (AP, =A), Ly = (t,-Z) 
Metric Suv = gt” = diag(1, —1, —1, —1) 
v — Abo 6” l, H= K 
AT SES Ea 
Lorentz invariant A- B= A,B" = gw A"B” = A°B°-A-B 
: : 4 — 3 _ (3 şğ = 1 Af 0 vi 
Derivatives On = 3,0 = (2.¥) , C= je = (2. -v) 
= 0,0" = & -V? 


0-A=0,A¥ =L 49.4 
a OH) = adHb — (Oa) b 


Antisymmetric tensor e#¥P? | with e0123 = +1 and e??? = —1 
Contractions chy ee pg = —24 EP PT E ypr = —697 
VY po A o W: 
grep EuvtTw = —2 (9292, E 92,97) 


Translation Invariance 


Let P” be the momentum operator, |i) and |f} momentum eigenstates, 


PH) = pil), PHILP = phl), (1.13) 
and let O(x) be an operator defined at spacetime point x, so that 
O(a) = éP *O(0)e*"*. (1.14) 
Then the x dependence of the matrix element (f|O(x)|i) is given by 
(f|O(a)|é) = e0122 FIOO) |i). (1.15) 


The combination of Lorentz and translation invariance is Poincaré invariance. 
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The Pauli Matrices 
The 2 x 2 Pauli matrices © = (01, 02,03) (also denoted by 7, especially for internal symme- 
tries) are Hermitian, o; = ol , and defined by 


[71,05] = QE GRO: (1.16) 


A convenient representation is 


T = G ae 02 = (; a , = e a (1.17) 


There is no distinction between g; and ot. Some useful identities include 


2 
Tro; = iaa = 0, Tr (ojo ;) = 26;; 
ro Xo r (o40;) y (1.18) 


{oi cj} 2ô;; I o? I, 0,05 = Oigl + i€ijkOk. 


The last identity implies 
(A.a)(B-@) =A. BI+i(Ax B) g, (1.19) 


where A and B are any three-vectors (including operators) and A-é is a 2x 2 matrix. Thus, 
(A- @)? = A?T for an ordinary real vector A with A = |A|, and 


iT — (cos AJI + i(sin A)A-@. (1.20) 
Any 2 x 2 matrix M can be expressed in terms of & and the identity by 


1 1 
M = 5 Tr(M)I + 5Tr (Ma) 3. (1.21) 


The SU(2) Fierz identity is given in Problem 1.1. 


The Delta and Step Functions 
The Dirac delta function ô(x) is defined (for our purposes) by 
+00 
J (x — a)g(a)dx = g(a) (1.22) 


for sufficiently well-behaved g(x). Useful representations of ô(x) include 


E fa 1 
—a)= — ik(a—a) gp = — li ; 1.23 
Ie- fete Ode = ie era] (123) 
The derivative of 6(x) is defined by integration by parts, 
too +% d§ (a — d 
/ 0 (x — a)g(x)dx = I G A ade Za (1.24) 
ee e dx da | za 


Suppose a well-behaved function f(x) has zeroes at xoi. Then 


=e 7 a ae (1.25) 
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The step function, O(x), is defined by 


nr fl, z>x 
o-i g eens (1.26) 
from which 6(x2) = dO/dz. 
Useful Integrals 
+oo 
Gaussian: Í eet Bt dy = elta, |Z for Rea >0 
a 
3j GF .—pl | (1.27) 
Yukawa: per i E i = a 
|z| w+ |g] 


1.1 PROBLEMS 


1.1 Let Xn, n = 1---4, be arbitrary Pauli spinors (i.e., two-component complex column 
vectors). Then the bilinear form y},,0;Xn is an ordinary number. Prove the Fierz identity 


(xdx3) © Ox) = 2r dx) 0x) — (xhx3)(xdx1); 
where np = +1. (The identity also holds for anticommuting two-component fields if one 
sets np = —1.) Hint: expand the 2 x 2 matrix yıx} in (x}x1)(xbx3) using (1.21). 
1.2 Justify the result (1.25) for d(f(a)). 


1.3 Calculate the surface area f dQ, of a unit sphere in n-dimensional Euclidean space, 
so that f dk = [dQ Jo. &” dk. Show that the general formula yields 


fan = 2, fai = 2r, fas = 4r, fan = 2n"*. 


Hint: Use the Gaussian integral formula (1.27) to integrate f d°ke-°* in both Euclidean 
and spherical coordinates. 


1.4 Show that the Lorentz boost in (1.11) can be written as 
E" \ _ (coshyz sinhyz E 
pi J  \sinhyz coshyz Pi J’ 


ipit 
YL= 5 I-G; 


where 


is the rapidity of the boost. 
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CHAPTER 2 


Review of Perturbative Field 
Theory 


DOI: 10.1201/b22175-2 


Field Theory is the basic language of particle physics (i.e., of point particles). It combines 
quantum mechanics, relativistic kinematics, and the notion of particle creation and annihi- 
lation. The basic framework is remarkably successful and well-tested. In this book we will 
work mainly with perturbative field theory, characterized by weak coupling. 

The Lagrangian of a field theory contains the interaction vertices. Combined with the 
propagators for virtual or unstable particles one can compute scattering and decay ampli- 
tudes using Feynman diagrams. Here we will review the rules (but not the derivations) for 
carrying out field theory calculations of amplitudes and the associated kinematics for pro- 
cesses involving spin-0, spin-$, and spin-1 particles in four dimensions of space and time, 
and give examples, mainly at tree level. Much more detail may be found in such field theory 
texts as (Bjorken and Drell, 1964, 1965; Weinberg, 1995; Peskin and Schroeder, 1995). 


2.1 CREATION AND ANNIHILATION OPERATORS 


Let |0) represent the ground state or vacuum, which we define as the no particle state (we 
are ignoring for now the complications of spontaneous symmetry breaking). The vacuum 
is normalized (0|0) = 1. We will use a covariant normalization convention. For a spin-0 
particle, define the creation and annihilation operators for a state of momentum 7 as a! (p) 
and a(p’), respectively, i.e., 


a(7)|0) =0, al (#)|0) = |p), (2.1) 


where |p) describes a single-particle state with three-momentum p, energy Ep = \/p? + m?, 
and velocity B = p/ Ep. We assume the commutation rules (for Bose-Einstein statistics) 


[a(@), a! (p")| = (27)?2E,8 (P-P), [aP), a(9")] = lat (8), a! ")] = 0, (2.2) 
which correspond to the state normalization 

(PIB) = (217)°2E, P(B- p’). (2.3) 

1Some formulas are simpler in the alternative non-covariant convention [an (P), al, (B^) | = 63(p—p’), with 


an (P) = a(P)/y/(271)32Ep. The corresponding single-particle state is |>)n = al,(p) |0), and the integration 
over physical states is S dp. Yet another possibility is (periodic) box normalization in a volume V = L3, 
leading to discrete three-momenta with it? component p; = ni, ni = 0,+1,+2,--- , and commutators 


[ag (P), al, (g')] = Onp- 
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This is Lorentz invariant because of the Æp factor. This can be seen from the fact that the 
integration over physical momenta is 


dp dtp 


where O©(x) is the step function and we have used (1.25). The right-hand side of (2.4) is 
manifestly invariant. The additional 2(27)? factor is for convenience. The corresponding 
projection operator onto single-particle states is 


3 => 
y= | ping PP (2.5) 


The interpretation of (2.2) is that each momentum 7 of a non-interacting particle can 
be described by a simple harmonic oscillator. The number operator N (p), which counts the 
number of particles with momentum p in a state, and the total number operator N, which 
counts the total number of particles, are given by 


3> 
NGEH) N= | ore NG) (2.6) 


(2.1)-(2.6) actually hold for any real or complex bosons, provided one adds appropriate 
labels for particle type and (in the case of spin-1, 2, ---) for spin. For real (i.e., describing 
particles that are their own antiparticles) spin-0 fields of particle types a and b, for example, 


[aa(P), a} (B")] = bav(2m)°2E, E(B- P"),  [aa(P), av(B")] = [al (B), a5 (7)] = 0. (2.7) 


Similarly, for a complex scalar, describing a spin-0 particle with a distinct antiparticle, 
it is conventional to use the symbols at and bt for the particle and antiparticle creation 
operators, respectively. (Which state is called the particle and which the antiparticle is a 
convention.) For example, for the 7* state, 


a‘ (p)|0) =|x*(8)), 1) |0) = |x“), (2.8) 
with 
[b(p'), b'(p’)] = (27)?2Ep (p-p), [a(7), b'(p")] = [a(P), b(W")] = 0. (2.9) 


The creation and annihilation operators for fermions are similar, except that they obey 
the anti-commutation rules appropriate to Fermi-Dirac statistics. The creation operator for 
a spin-4 particle is at (p, s), where s refers to the particle’s spin orientation, which may be 
taken with respect to a fixed z axis or with respect to p (helicity). Then, 


{a(p, s), at (p", s')} = alp, s)al (p", s') + al (p’, salp, s) = (27)°2E, 8 (P — B')ôss. (2.10) 
Similarly, for the antiparticle, 
{b(p, 8), b'(p’, s')} = (27) °2E, 6° (P — p" bss, (2.11) 


while 


{a,a} = {b,b} = {a,b} = {a,b} =0 (2.12) 
for all values of p, p’, s, and s’. Fermion and boson operators commute with each other, e.g., 


[aboson, Afermion] = 0. 
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Non-interacting multi-particle states are constructed similarly. For example, the state 
for two identical bosons is 


|71 2) = |Peb1) = a! (p1 )a' (G2) |0) (2.13) 
with 
(1 P2|P3p4) = (2m)? 2Ey (2)? 2E> [5° (P1 — P3)d° (B2 — Pa) (2.14) 
+ ô (Pı — ps)d° (P2 — p3)] - 
Similarly, for two identical fermions, 
|7181; D282) = —|P282; P151) = at (pi, 81)a" (p282)|0), (2.15) 
with 
(B151; P2S2|P383; Pasa) = (27)°2Ey (2m)? 2E» [5° (P1 — P3 )ðs1 s30? (B2 — D1) s254 
— & (By — P4))s, 049° (P2 — B3)0s9s5| - (2.16) 


2.2 — LAGRANGIAN FIELD THEORY 


Consider a real or complex field ¢(x), where x = (t,Z). The (Hermitian) Lagrangian density 


L(9(x), LPE), P (a), ,0" (e)) (2.17) 


contains information about the kinetic energy, mass, and interactions of ¢. We will generally 
use the simpler notation £(¢, 0,,), or just L(x), with the understanding that for a complex 
field £ can depend on both ¢ and its Hermitian conjugate ¢'. Equation (2.17) is trivially 
generalized to the case in which there is more than one field. It is useful to also introduce 
the Lagrangian L(t) and the action S by integrating £ over space and over space-time, 
respectively, 


L(t) =f a2 L(>, pH), s-{~ dt L(t) = | de L(o, 0,4). (2.18) 


The Euler-Lagrange equations of motion for ¢ are obtained by minimizing the action with 
respect to (x) and t(x), 
ôL ôL 


36 50,6 = 0, (2.19) 


and similarly for ¢'. The fields ¢ are interpreted as operators in the Heisenberg picture, 
i.e., they are time-dependent while the states are time independent. Other quantities, such 
as the conjugate momentum, the Hamiltonian, and the canonical commutation rules, are 
summarized in Appendix A. 


2.3 THE HERMITIAN SCALAR FIELD 


A real (or more accurately, Hermitian) spin-0 (scalar) field satisfies (x) = t(x). It is 
suitable for describing a particle such as the 7° that has no internal quantum numbers and 
is therefore the same as its antiparticle. 
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2.3.1 The Lagrangian and Equations of Motion 


The Lagrangian density for a Hermitian scalar is 


£($, 048) = 5 [(8n8)? - m6] — Vrlo), (2.20) 


where (0,,¢)* is a shorthand for (0,,¢)(0"¢). The first two terms correspond, respectively, 
to canonical kinetic energy and mass (the $ is special to Hermitian fields), while the last 
describes interactions. 

The interaction potential is 
A g : 
har r+ c+dio+ ds zy +:-- + non — perturbative, (2.21) 
where the k! factors are for later convenience in cancelling combinatoric factors,? and “non- 
perturbative” allows for the possibility of non-polynomial interactions. The constant c is 
irrelevant unless gravity is included. A non-zero dı (tadpole) term will induce a non-zero 
vacuum expectation value (VEV), (0|¢é|0) 4 0, suggesting that one is working in the wrong 
vacuum. The dı term can be eliminated by a redefinition of ¢ > ¢’ = constant + ¢, as will 
be described in Chapter 3. £ and ¢ have dimensions of 4 and 1, respectively, in mass units, 
so the coefficient of œ% has the mass dimension 4— k. The dy terms with k > 5 are known as 
non-renormalizable or higher-dimensional operators. They lead to new divergences in each 
order of perturbation theory, with dẹ typically of the form dp = c,/M*~4, where cp is 
dimensionless and M is a large scale with dimensions of mass. Such terms would be absent 
in a renormalizable theory, but may occur in an effective theory at low energy, where they 
describe the effects of the exchange of heavy particles (or other degrees of freedom) of mass 
M that are not explicitly taken into account in the field theory. (In Chapters 7 and 8 we 
will see that an example of this is the four-fermi operator that is relevant to describing the 
weak interactions at low energy.) Keeping just the renormalizable terms (and c = dı = 0), 
one has 

ne CA 


Vi(9) = T 


where « (dimensions of mass) and A hue aiee describe three- and four-point interac- 
tions, respectively, as illustrated in Figure 2.1. From the Euler-Langrange equation (2.19), 
one obtains the field equation 


+A (2.22) 


2 P g 

H H = Hm —+A—=0, 2.23 
(D +m) 6+ i= (tm) e405 ta (2.23) 
where 04+m? = 0,,0"+m? is the Klein-Gordon operator. The expression for the Hamiltonian 
density is given in Appendix A. 


2.3.2 The Free Hermitian Scalar Field 
Let ġo = oi, be the solution of (2.23) in the free (or non-interacting) limit k = A = 0, i.e., 
(CO +m?) do(x) = 0. (2.24) 


Equation (2.24) can be solved exactly, and small values of the interaction parameters « and 
A can then be treated perturbatively (as Feynman diagrams). The general solution is 


3 > 
doa) =la) = farang [alpen + al ers], (2.25) 


?Conventions for such factors may change, depending on the context. 
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Figure2.1 Three- and four-point interactions of a Hermitian scalar field ¢. The factor 
is the coefficient of 6” /n! in i£, as described in Appendix B. 


where z = (t,%) and p = (E,,p) with Ep = \/p?+m?, i.e., the four-momentum p in 
the Fourier transform is an on-shell momentum for a particle of mass m. The canonical 
commutation rules for ġo and its conjugate momentum given in Appendix A will be satisfied 
if the Fourier coefficients a(p') satisfy the creation-annihilation operator rules in (2.2). 

It is useful to define the Feynman propagator for ġo, 


iAr(x — x’) = (0|T [¢0(x), ¢o(x")]|0), (2.26) 
where 
T [¢0(x), G0(a’)] = O(t — t')b0(x)bo0(2") + OC — t)bo(2x")Go(a) (2.27) 


represents the time-ordered product of ¢o(x) and ¢o(a’). In (2.27) O(t — t’) is the step 
function, defined in (1.26). Ar(a — 2’) is just the Green’s function of the Klein-Gordon 
operator, i.e., 


(A, +m?) Ap(x — 2’) = —64* (x — 2’), (2.28) 
where O, refers to derivatives w.r.t. x. The momentum space propagator is 
Ar(k) = [dive Ae) = : = : : (2.29) 
k2 — m2 + ie k2 — k? — m? + ie 


k is an arbitrary four-momentum, i.e., it need not be on shell. The on-shell limit is correctly 
handled by the ze factor in the denominator, where € is a small positive quantity that can 
be taken to 0 at the end of the calculation. 


2.3.3. The Feynman Rules 


The Feynman rules allow a systematic diagrammatic representation of the terms in the 
perturbative expansion (in «s and A) of the transition amplitude Mp; between an initial 
state 7 and a final state f. The derivation is beyond the scope of this book, but can be 
found in any standard field theory text. (The derivation of a simple example is sketched in 
Appendix B.) Heuristic derivations may also be found, e.g., in (Bjorken and Drell, 1964; 
Renton, 1990). For the Hermitian scalar field with the potential (2.22), the rules are 


Draw each connected topologically distinct diagram in momentum space correspond- 
ing to initial (final) states i (f), with internal lines corresponding to virtual (inter- 
mediate) particles. The internal and external lines are joined at three- and four-point 
vertices corresponding to the interactions in V;. Each external and internal line has 
an associated four-momentum, which is off-shell for the virtual particles. It is conve- 
nient to put an arrow on the line to indicate the direction of momentum flow. This 
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direction is only a convention, and for the Hermitian scalar field (with no internal 
quantum numbers) there is no restriction on how many arrows flow into or out of a 
diagram. 


There is a factor of —ik at every three-point vertex and a factor —i\ at each four- 
point vertex, as in Figure 2.1. These correspond to the coefficients of ¢°/3! and ¢4/4!, 
respectively, in i£, with the 1/3! for « cancelling against 3! ways to associate the three 
lines with the three fields in ¢?, and similarly for the À term. 


There is a factor of iA p(k) = for each internal line with four-momentum k. 


i 
k2—m?2+ie 
Four-momentum is conserved at each vertex, implying that the overall four-momentum 
is conserved (i.e., My; is only defined for Up; = Up,). 


Integrate over each unconstrained internal momentum (there will be one for each 
internal loop in the diagram), with a factor f okt. 

There may be additional combinatoric factors? associated with the interchange of 
internal lines for fixed vertices, e.g., a factor of 1/n! if n internal lines connect the 
same pair of vertices, as in Figure 2.2. 


Figure 2.2 Diagrams with additional factors of 1/2! (left) and 1/3! (right), required 
because not all of the (4!)? ways of associating the four fields at each vertex with 
four lines lead to distinct diagrams. 


The ordering and arrangement of the external lines in a Feynman diagram is usually irrele- 
vant, although the relative ordering between two diagrams does matter for fermions. In this 
book we will usually, but not always, place the initial state particles at the bottom and the 
final particles at the top. 

As a simple example, the tree-level diagrams for the 2 — 2 scattering amplitude 
My; = (p3p4|M|pip2) are shown in Figure 2.3. Applying the Feynman rules, these diagrams 
correspond to the expression 


My; = ià + (ik)? (2.30) 


s—m? Tiea u- m? 
where s, t, and u are the Mandelstam variables 
s = (pı + pe)? = (p3 + pa)? 
t= (pı — ps)” = (pa — p2) (2.31) 
u = (pı — pa}? = (ps — pe)”. 


3In general, there may be subtleties involving combinatorial factors (or, signs when fermions are involved), 
especially in higher-order diagrams, which are best resolved by returning to the original derivation. 
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For the present (equal mass) case, s = E2,, > 4m?, t < 0, and u < 0, where Ecm is 
the total energy in the center of mass. The internal lines are never on-shell (s,t,u 4 m?) 
for physical (on-shell) external momenta, so one can drop the +e. It is implicit that the 
external momenta satisfy the four-momentum conservation pı + po = p3 + p4. The second, 
third, and fourth diagrams in Figure 2.3 are said to have s-channel, t-channel, and u-channel 
poles, respectively. 


P3 P4 P3 Pa BB ; P4 
` e ~ —ik 
` - ` - v. y 
Y. nA ~~ [K `w” 
s + ` * I 
A = =i “se! + À iA p(p1 + p2) 
z” Pa è 
A v, ee N ae 
d ~. ~~ ` Pia 7 —ik v. r 
Pı P2 Pi P2 Pi P2 
P3 P4 Pa P3 
+o =i W- ik + tk W--m--6 ir 
4 4F Pid) d Ara N 
r “i Fd A 
Pi P2 Pi P2 


Figure 2.3 Tree level diagrams for Mp; = (p3p4|M |p p2) for a Hermitian scalar field. 
The arrows label the directions of momentum flow, and p;; = p; — pj. The arrange- 
ment of lines in the last diagram is modified to allow the diagram to be drawn 
without crossing lines. 


2.3.4 Kinematics and the Mandelstam Variables 


Let us digress to generalize to the case of a 2 — 2 scattering process 1+2 — 3+4, where 
we allow for the possibility of inelastic scattering with unequal masses for 1, 2, 3, and 4. 
In the absence of spin, the scattering amplitude can be expressed in terms of the Lorentz 
invariant Mandelstam variables defined in (2.31). s, t, and u are not independent, but are 
related by 
s+t+u= m] +m +m +m. (2.32) 

Of course, s = m? + m3 + 2p1ı - pa, ete. 

The kinematics is simplest in the center of mass (CM) frame, which is more accurately 
the center of momentum, in which the total three-momentum of the initial and final state 
vanishes: 


Pı = (E1, Pi), P2 = (E2, = Pi), P3 = (E3, pf), P4 = (E4, —pf), (2.33) 


where p; and py are, respectively, the initial and final three-momenta; the energy and 
velocity of particle 1 are 


E =p? +m, ĝ= ae (2.34) 
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and similarly for 2, 3, and 4; and the CM scattering angle @ is related by 
Pi: Pf = pips cos, (2.35) 


as shown in Figure 2.4. In the CM frame, s = (E1 + E2)? = (E3 + E4)? is just the square 


= 3 3 2 
23 Pf 
Pi 0 1 
1 S 
—Pf 


(ii) 


Figure 2.4 Scattering kinematics in (i) the center of mass, (ii) the lab, and (iii) the 
Breit frames. 


of the total energy. In the physical scattering region, s > (mı + m2)? and s > (m3 + m4}. 
Using p2 = pı + p2 — pı one finds 


2 2 
s+mi-— ms V/s 


E = 2.36 
i 2/8 IRS 2 : ( ) 
so that 
1/2 1/2 
pia pea [s — (mı + m2)?] ‘~ [s — (mı — m2)? ] 
i= 1 15 
ays (2.37) 


with similar expressions for the other particles. This is sometimes written as 


AM? (s, mz, m3) 


Pi = Mr ; (2.38) 


where 


2 O° iy 


Hy“ + 2° — Qay — 2rz — 2yz. (2.39) 


A(x, y, z) =x 


The ¢ and u variables, which describe the momentum transfer between particles 1 and 
3 and between 1 and 4, respectively, are given in the CM by 


0 
t =m? + m3 — 2E, E + 2pips cos 0 ———+ —2p7(1 — cos 0) = —4p° sin? 5 <0 
pig Ae) 
aie p (2.40) 
u =m + mi — 2E, E4 — 2pipf cos 0 ———> —2p?(1 + cos 0) = —4p? cos? 5 <0, 
miı=m4 
m2 =mM3 


where the last expressions are for elastic scattering (mı = m3, M2 = M4 or Mı = Ma, 
mə = mz), for which p; = pr = p. Note that t and u are negative at high energies (e.g., the 
last expressions in (2.40) hold with p ~ ys/2 when the masses can be neglected), but may 
be positive at low energies if the masses are not all equal. 
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Fixed target experiments are carried out in the laboratory frame, in which 2 is at rest, 


> 


pı = (E1, Pı), po = (m2,0), (2.41) 
with 
s- m? -må 
pa miom (2.42) 


2M 


A sometimes useful relation between the CM and laboratory variables is 
pi = |Pilme/V/s, (2.43) 
where p; is the CM momentum in (2.37). From energy and momentum conservation, 
Ez + E4 = Fi + mo, Pal? = |211? + |231? — 21p1 |123] cos 03, (2.44) 


so that Ez and E; can be expressed in terms of s and the laboratory scattering angle 03 for 
particle 3. The relations between the laboratory variables and t and u are straightforward. 
For example, 


t = m3 + mi — 2E, E3 + 2|p1||p3| cos 03, u = ms + mî — 2m2 E3. (2.45) 


These formulae become especially simple in the special case mı = m3 = 0 and mz = ma, 


e.g., 
P3 1 
pı 1+ (1 — cosb)’ Pips(1 — osta ee) 


with pı = |pi| = E, and p; = |p3| = E3. 

Yet another frame, especially useful for theoretical purposes, is the Breit or brick wall 
frame (e.g., Hagedorn, 1964; Renton, 1990), in which the scattered particle 3 simply reverses 
the direction of 1. For mı = mg, the momenta are 


Pı = (E1, P), P3 = (E1, —p), (2.47) 


so that 
t = —Alp|?. (2.48) 


We will see an example in discussing the simple parton model in Section 5.5. 


2.3.5 The Cross Section and Decay Rate Formulae 


In this section we sketch the derivation of the relation of the transition amplitude Mp; to 
the cross section or decay rate. The results apply to any field theory, not just Hermitian 
scalars. 


Two-Body Scattering 


First consider the cross section for the 2 + n process i > f, where |i) = |pipo) and 
|P) = |B +--+ Bta), as shown in Figure 2.5. (Particle-type and spin labels are suppressed.) 

As described in Appendix B, the transition matrix element Uy; and transition (scatter- 
ing) amplitude M,; are related by (B.1), so the transition probability is 


2 


Ural? = |(2r)*54(S pis — pı — po) Mil (2.49) 
k 


16 The Standard Model and Beyond 


Figure2.5 The 2 — n scattering process pip2 — Prit’ pf, (left); 1 > n scattering 
pı — PfP, from a potential, represented by a cross (middle); and the 1 > n 
decay process pı — py,-:-py, (right). 


To interpret the square of the delta function, it is convenient to temporarily assume a finite 
volume V for space and a total transition time T, which can be taken to oo at the end. 
Then 


2 2 
= y dag et © r Pty P172) 2 
V,T 


(2m)464 (© Pf, — Pi p2) (2.50) 
3 l 


= VT (2r)*8t (~~ Pf, — P1 — p2) ; 
k 


where we have used one delta function to replace the integrand of the other integral by 
unity. 

A scattering cross section is defined as the transition rate divided by the relative flux. 
The differential cross section to scatter into [],, d°pp, is therefore 


aH ate) ate) 
T |B, —Bo| 25V) \2EaV 


= (On) Oe Pf, — Pl — p2) I Bop, 
4E E2| G1 — 62| pr 2M) 2Ep, 


n 


ll Bor, V 


2E, V (27)? 


k=1 


(2.51) 


|M jil’. 


In the first line, |Up;|?/T is the transition rate, |51 — B2|/V is the relative flux, the 1/2EV 
factors correct for the normalization of the covariant states, and the n factors of V/(27)° 
represent the density of momentum states of the n final particles. Note that the factors 
of V and T cancel in the final expression in (2.51), and that all the particles are on-shell 
(E2 = P? + m2). The flux factor |G; — 82| is evaluated using 8; = p; / E}. 

It should be intuitively clear that the differential cross section is the same in the lab 
frame and in collinear frames related to the lab by boosts along the pı direction, such as 
the CM frame, but is not the same in arbitrary frames. This can be seen by the fact that 
the factor Fy F4| Bi — B| in the denominator is equal to the Lorentz invariant quantity 
[(p1 - p2)? — m?m2]!/2 in collinear frames. In fact, the cross section formula is often written 


4The inner product (p|p’) = (27)32E, 8 (P — p’) = 2Ep f deei P-P )z goes to 2EpVôpp in a finite 
volume V. 
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in terms of that quantity, though that is only strictly valid in the collinear frames.° The 6+ 
function and the scattering amplitude My; in (2.51) are manifestly Lorentz invariant. The 
final state phase space factors are also invariant, as can be seen in (2.4). The differential 
cross section can be integrated over the ranges of momenta of interest. The total cross 
section for the specific process is obtained by integrating over all final momenta, 


a= || Sı | do, (2.52) 
s 


where the statistical factor S; = 1/l! must be included for any set of l identical final particles 
to avoid multiple counting of the same final state. 

Now, consider the example of 2 > 2 scattering in the CM, as illustrated in Figure 2.4(i). 
For a given CM energy ys = Ecm, the initial and final momenta p;, pf, and the energies 
Ea, a = 1---4, are fixed by (2.36) and (2.37). It is convenient to introduce spherical 
coordinates, with the z axis along pi, so that py has polar angle 0 and azimuthal angle 
y. For spin-0 particles (or unpolarized initial particles with non-zero spin) the scattering 
amplitude My; is independent of p by rotational invariance. 

The differential cross section is given by 


2 454 _ _ dp. ap, 
do = ( T) ô (p3 tpa pı p2) Ps - |M pil? (2.53) 
AF, E2|8, — 62l (27)2E3 (27)32E4 
In the CM, 
-o ft. 1 
FE, E2| 61 — B2| = E1 E2 |P; T + m) pi(Eı + E2) = pi v's. (2.54) 


The implicit p4 integral can be done using the four-momentum conservation, 


dp: ay, ae 
Misti pi — p) a + E, — a 
Es E4 E3 E4 (2.55) 
2d i 
= 2r dcos 0 6(E3 + E4 — vs) P3 EA 
Ez E4 


where E; and E4 represent \/p32 + m3 and \/p3 2 + m3, respectively. In the last expression 

the azimuthal angle has been (optionally) integrated over, i.e., d?p3 = dy dcos@ p3 dps > 

2r dcos p3 dp3. The remaining delta function determines p3. Using p3dp3 = E3dE3 = 

E, dE,, d(E3 + E4)/dE3= 1 + E3/E,, and the identity (1.25), the quantity (2.55) is equal 
to 

Pf 
2r dcos@ —=. 2.56 
ndeost ZL (2.56) 


The differential cross section is therefore 


do 1 pf 4 | Mil? 
= Mrl ———> . 
dcos@ 3275 pi |M mı=m3 3278 


m2 =M4 


(2.57) 


The initial momentum p; is given in (2.37), while py is of the same form with mı > m3 and 


5In applications to astrophysics one is usually interested in the thermal average of o|B1 — Bol, so the flux 
factor cancels (e.g., Kolb and Turner, 1990). 
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mz — m4. The last expression in (2.57) is for elastic scattering, for which p; = pr. Closely 
related quantities are 


do 1 do do 1 do 


= = 2, 
dt 2pipy dcos? dQ 2m dcosð’ (2.58) 


where t is the Mandelstam invariant given by (2.40), and dQ = dọ dcosð is the solid angle 
element. The second form is useful if dy is not integrated over. 

In our example of the Hermitian scalar field with V; given by (2.22), the differential 
cross section at tree level is obtained from (2.30) and (2.57), 


2 


do 1 2 1 1 1 
= | } 2; 
dcos@ 327s ube En atl , eee) 
where 
s= Ebm =4(p? +m’), t=-—2p(1— cos), u=—2p?(1+cosé), (2.60) 
and p = p; = pr. The total cross section is then 
1 (+! do ti do 
=- —— dcos = ——_ 6 2.61 
° 1 dcos6 0 o dcos A oor 
where the 1/2 is because the final particles are identical. 
One can also use (2.53) to calculate the cross section in the lab frame, using 
Ey B2|81 — b| = E1m2|ĝ1| = prime. (2.62) 


The phase space integral can be carried out explicitly in the lab frame, or can be obtained 
by Lorentz transforming the CM result. In analogy with the derivation of (2.56) 


Pp, dp. 27|p3| dcos 6 
6* (ps + p4 — pı — p2) ee IPs! 2 (2.63) 
Es Ea fg, F 4 Es 


dE; 


where 63 is the laboratory scattering angle of particle 3. dE,/dE3 can be calculated using 
the second equation in (2.44), yielding 


da _ |M pil? P3 


= : 2.64 
dcosĝs  32rpımə Ei + ma — cos 43 ( ) 
Using (2.46) this implies 
d Mpil? : 
Tos (es (2.65) 
dcos@3 32ams \ pı 


for the special case mı = m3 = 0 and mz = m4. 


Potential Scattering 


Consider the 1 — n scattering of a single particle from a static source (i.e., potential 
scattering), as illustrated in Figure 2.5. This may be an approximation to scattering from 
a heavy target particle. In particular, suppose £ contains an interaction term 


L1(x) = Lp(x) ®(0,2Z), (2.66) 
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where ®(0, 7 ) is the static source and £p involves ordinary quantum fields. Then, in analogy 
with Equation (B.4) from Appendix B, the tree-level transition amplitude for i > f, with 


li) = |1) and |f) = [Pr +++ Brn) is 


U= f de (FliLp(@)l) 0,2) 


1 Pare a 
aap | ae dq (f\iLp(0)|t) ®(Z)e (ps—P1—4) ; 


(2.67) 


where 


(g) = fèr F260, z’) (2.68) 


is the Fourier transform of ®, q = (0,7), and we have used translation invariance for the 
matrix element. Then, carrying out the x integral, 


Upi = 2n6 (Ey — Ey) Mji, (2.69) 


where 
Mpi = (fliLp(0)|t) © (By — D1) (2.70) 


and pr = +, Pfa. Proceeding as in (2.51), the differential cross section is 


_ Ual? V ( 1 ) I app, V 
T B] VEV) | 44 28;,V 2r? 


_ 2rô(Fz — Fi) i dar, Wa 
2Fy|54| z (207)? 2Ep, 


(2.71) 


so that energy but not 3-momentum is conserved, as expected. For the important special 
case of elastic scattering, i.e., n = 1 with m2 = mı and p2 = pf, 


Pi2| = BEL2 =p, with 6 = |Â]. (2.72) 
But, 
ô (E> = E) dpo = p2 Eod cos Ody, (2.73) 
where 0 and y are the polar and azimuthal scattering angles. Therefore, 
do 1 do 1 
= M;;|* = —|Myi|’. 2.74 
dcos Ody T67?! til ”  deosd Sa fil (27a 


The last form holds when Mp; depends only on 
- 5 ee . 20 
||? = |po — p|? = 2p?” (1 — cos 0) = 4p? sin? 5? (2.75) 


as in the case of spinless or unpolarized particles and a radially symmetric source. As a 
simple example, consider 


1 
Crs -30 Or), (2.76) 
where ®(r) is the spherically symmetric Yukawa potential 
-ir 7 4 
30,7) =k > á öğ) ie (2.77) 


r PFP 
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with r = |Z |. ® can be thought of as arising from the exchange of a heavy scalar of mass ju 
between a nucleus and the scattered particle described by ¢. (This could be an approximate 
model for the contribution to pion-nucleon scattering from the exchange of a heavy scalar 
resonance, such as the ø or the fy(980).) From (2.70) and (2.74) we obtain Mp; = —i®(@) 


and 
do QrK? 


deos (uw? +P 


(2.78) 


Decays 


One can also consider the 1 —> n decay process with |i) = |p) and |f) = |p, Bsn) (Figure 
2.5). Similar to (2.49) and (2.51), one has 


|U]? = VT(2m)464 93 Pf, — pı) Wek (2.79) 
and differential decay rate 


1 Url? T PPp 
dr = d = = 
T T JEV lI a 2E f V 27) 


(2 m)*6 Wen Z d? Pre 2 
Ml, 
: all (27)22Ey, [Mil 


(2.80) 


rest frame 2m fi 


where T is the decay rate of particle 1 and 7 is its lifetime, and the second expression is 
specialized to the rest frame of the decaying particle. The total decay rate is obtained by 
integrating over the final particle phase space, 


r=]]s, / a, (2.81) 
IIs | 


where S; = 1/I! is a statistical factor for l identical particles, analogous to (2.52). 
For a 2-body decay, 1 > 2 + 3, Equation (2.80) simplifies to 


(27)46*(p3 +p2— pı) dpe d? p3 


dr = Mrl? 
2m1 (27)32 E2 (Om)22B, sil aea 
|M]? [p2E2dE> l 
= ô(E2 + Ez — dQ = —*? |M;l?dQ, 
3272m; | E2E3 (Fa + Ea = ma) 327 a = nl? 


where E23 = 4/ p2? + m33 in the second line. The quantity in square brackets evaluates to 


p2 _ [mt — (m2 + m3)?| me [mi — (m2 — map? (2.83) 


mı 2m? 


where the expression for pz is analogous to (2.37). 

As an example, consider three distinct Hermitian scalar fields ¢;, i = 1---3, with masses 
m,;. If my > M2 + mz it is possible for particle 1 to decay into 2+ 3 as shown in Figure 
2.6, provided there is an interaction term to drive the decay. The simplest appropriate 
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m a-m 


Figure 2.6 The two-body decay of the spin-0 scalar 1 into 2 + 3, and the associated 
Feynman diagram corresponding to (2.84). 


Lagrangian is 
3 
j 
L= >I 5 (anpi)? — M? E7] — Kb1¢2ds, (2.84) 
i=1 
where the kinetic energy and mass terms generalize (2.20). No counting factor is needed 


in the interaction term because the fields are distinct. The tree-level decay amplitude is 
therefore 


Mpi = ~ir, (2.85) 
i.e., the coefficient of ¢,¢2¢3 in i£. The decay rate is therefore 
k?p2 k?p 
rT = —— |Q= : 2.86 
3272m? f 8rm? ( ) 
An 


More general techniques for evaluating phase space integrals are discussed in Appendix 
D and in (Barger and Phillips, 1997). 


2.3.6 Loop Effects 


In addition to the tree diagrams shown in Figure 2.3 for the potential in (2.22), there are 
a large number of loop diagrams. A sample of the many one-loop diagrams is shown in 
Figure 2.7. Of course, the relative strength of these and higher-loop diagrams depends on 
the magnitudes of À and k?. 

The first diagram is shown in more detail in Figure 2.8, with the momenta labeled. The 
external lines are on-shell, while the two internal lines are not. Four-momentum is conserved 
at each vertex, so only the single momentum k is integrated over. From the Feynman rules, 
this diagram contributes 


(—id)? Í dtk i i 
M = 2; 
2 (27)* k? —m2+ie (k— pi — po)? — m? + ie 87) 


to the scattering amplitude, where the 4 is due to the identical particles. The techniques for 
evaluating such integrals will be illustrated in Appendix E and are developed in detail in field 
theory texts. Here we just note that the integral is logarithmically divergent for large k (the 
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Figure 2.7 Examples of one-loop diagrams contributing to the process pyp2 — p3p4. 


integrand goes as d*k/k*). Fortunately, the theory is renormalizable,° so the divergences can 
all be absorbed in the observed (“renormalized”) values of m, A, and the “wave functions,” 
leading to finite and calculable quantities. The pure ¢° theory (i.e., A = 0, k #0) is super- 
renormalizable: the only divergent diagrams are the two one-loop diagrams shown on the 
right in Figure 2.8. (These may appear as components of larger diagrams.) It should be noted 


=a = 
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Figure 2.8 Left: A typical one-loop diagram, with the momenta flowing through each 
line labeled. Right: Divergent (sub)diagrams in the super-renormalizable ¢° theory. 
The second (tadpole) diagram contributes to the dı term in (2.21) and must be 
included in the field redefinition that eliminates it. 


that the modern view of divergences is that the integrals are not truly infinite. Rather, they 
reflect the view that a low energy field theory is an approximate description valid below some 
energy scale A, such as the Planck (gravity) scale Mp = ew ~ 1.2 x 10/9 GeV, above 
which there is a more complete theory. A logarithmically divergent integral such as (2.87) 
is therefore proportional to A? In(A/m). As long as this quantity is small, the sensitivity 


to the new physics scale is not great. In addition, the leading logarithmic contributions 


6In a renormalizable theory all divergences can be absorbed in a finite number of quantities, which can 
either be measured or drop out of final expressions for observables. Non-renormalizable theories, on the 
other hand, encounter new divergences at each order in perturbation theory. 


Review of Perturbative Field Theory 23 


can be summed by the renormalization group equations. This will be discussed further in 
Section 2.12.2. 


2.4 THE COMPLEX SCALAR FIELD 


Now let us consider the complez, i.e., non-Hermitian, scalar field ¢ 4 ¢', which describes a 
spin-0 particle that is not the same as its own antiparticle. For example, ¢,+ annihilates a 


mt or creates a 7, while ¢,- = pla is the corresponding z~ field. 
The Lagrangian density for a complex scalar field is 
L= Lİ = (3 p (OH) — mG- Vi(9, 9"), (2.88) 


where the (Hermitian) potential V; is given by’ 
ty = A(6tey? 
V7(¢,¢') = 1? $)“ + non-renormalizable. (2.89) 


There is no ł in the kinetic energy and mass terms for a complex field, and the 1/4 in Vy 
is for later convenience. Keeping only the quartic term in Vz, the field equation is 


ôL g L 
õpi HS0, 


=0 (A +m?) d+ Zoloto) =0. (2.90) 


A complex scalar field ¢ can always be expressed in terms of two real fields 6; = dl, 
i= 1,2, by® 


1 
o= 5 (%1 + ide), ý = a (1 — ig). (2.91) 
In terms of these, £ in (2.88) can be written 
z Ly A 
L=) 5 [Ou — moi] - gt. (2.92) 


i=1 
This is not the most general renormalizable Lagrangian for two real scalars because the 
masses are the same and the potential is of a special form that leads to a conserved quantum 


number. 
The solution for the free complex field (i.e., A = 0) is 


dp EO = ape 
polz) = [a(p)e Up x +b (p)e up zi 
J ene (2.93) 


where a (p) and b (p) create t+ and m~ states, respectively, as in (2.8). They satisfy the 


’ There are other possible renormalizable terms in Vz, such as on[(¢)" +(')"], n = 1---4, or pm[(o)™ + 
(pt)™]¢t¢, m = 1,2. These would not allow a conserved charge, and would lead to processes such as 
ntrt — n n (Problem 2.6). Equivalently, they would not exhibit the U(1) phase symmetry discussed in 
Section 2.4.1. 

8It is a convention whether ¢ = (¢1 +i¢2)/V2 is identified as ¢, +, as is done here, or with ¢,— . In later 
chapters we will often take the opposite choice, especially to be consistent with isospin conventions. 
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commutation rules in (2.2) and (2.9). a and b can be expressed in terms of the creation 
operators for the real fields ¢19 and ¢29 by 


1 1 
ap) = -pla (P) + ta2(P)], bØ) = ela) — taa(P)] 
e i (2.94) 
'@p) = geie) - iaè(p)], (p) = geie) + iab (p)]. 
The Feynman propagator for the complex field is 
4 e~ ik: (x—-2') 
OT) AEO = iare- al) =i f ST, C) 


so that an internal line carrying momentum k in a Feynman diagram is associated with the 
factor i^ p(k) defined in (2.29). 

In Feynman diagrams, the arrows represent the direction of flow of positive charge. For 
example, an external line with an arrow going into the diagram may represent either an 
initial state 7+ or a final state m~, and conversely for an arrow leaving the diagram. Each 
vertex is associated with a factor —i\ and must involve the same number of entering and 
exiting lines (charge conservation). It is conventional to label internal momenta so that the 
arrow also coincides with the direction of momentum flow, while the momenta for external 
lines are the physical momenta, which are entering the diagram for initial particles and 
exiting for final particles. Thus, the physical momentum flows opposite to the arrow for a 
am, as illustrated in Figure 2.9. We also introduce the symbol p for the momentum flowing 
in the direction of the arrow, i.e., Dy+ = Epr+. 


nt (Bs) n (Pa) T(P) n` (Pa) 
DA a » tAr(k + pi — ps)” 
. ye. OZ 
ia x =i Ow X —ià 
€ a O Me 
P. iAr(k) 
nt (Pi) n` (P2) n+ (pi) n (P2) 


Figure 2.9 Left: Tree-level diagram for n+ (p1) 7 (p2) > n*(p3) n~ (p4), where 5,2 
(p3,4) are the physical initial (final) momenta. Right: An example of a one-loop 
diagram. Ap(k) is defined in (2.29). 


2.4.1. U(1) Phase Symmetry and the Noether Theorem 


The Lagrangian density in (2.88) is invariant under the phase transformations 


G(x) + $'(a) = e 6(x) ——> 4(2) + 69(2), (2.96) 


B small 


where 6¢(x) = i6¢(x), which means that 


L(¢, t) = L(g, 6"). (2.97) 
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The invariance holds for all £, i.e., £ has a U(1) symmetry group. U(1) is the group of 1 x 1 
unitary matrices, which is a fancy expression for phase factors. The symmetry is global,’ 
i.e., 3 is a constant, independent of x. 

According to the Noether theorem, any continuous symmetry of the action leads to a 
conserved current and conserved charge. For example, translation invariance of the action 


under x, — i, = £, — a, leads to energy and momentum conservation, with the four- 


momentum operator P” = (H, P) in (A.5) on page 507 conserved (see, e.g., Bjorken and 
Drell, 1965). In the present case, invariance of £ automatically implies that the action is 
invariant also. As will be shown below, the Noether theorem then implies that 


ÔL = —Bd,,J", (2.98) 


where 


is the change in the Lagrangian density under (2.96) for infinitesimal 8. In (2.98), J” is the 
conserved Noether current 


Jt = —i[(a"o)'o — pao] = igt Fo. (2.100) 


s? 
In the last expression, we have introduced the symbol 0”, defined by 


a Ob = ad"b— (la) b (2.101) 
for any a and b. 
The Noether charge 
Q= J ËZ J? (x) (2.102) 


is then conserved, assuming there is no current flow at infinity, because 


0 0 Sy 
OE sss pig OF ASD pez OF) + F. Ja) 
bose oot (2.103) 


integral =0 
z jez ð J” (x) = 0. 


This is most easily interpreted by explicitly calculating Q for non-interacting fields ġo, for 
which one has easily that 


dp 
[at Salo) — blis)b(a)) = = 
Q= | ar, lah Paw) -OWP = Ne — Nee. (2.104) 
That is, the conserved Noether charge of a state is just the total number of particles minus 
the number of antiparticles, which coincides with electric charge for the specific example. 
It is straightforward to show that the conservation law continues to hold in every order of 


perturbation theory provided that the interaction term respects the symmetry (commutes 
with Q). 


The derivative terms in £ would not be invariant for 8 = B(«). Later, we will discuss how to modify £ 
to yield local (gauge) symmetries under which ¢(x) > e#?) g(a). 
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Derivation of the Noether Current for the Complex Scalar Field 


Under the infinitesimal phase transformation (2.96), the change in the Lagrangian density 
is 


_ ôL a on 
å Laa io)’, A 
where 
õp(x) = 189(2), 5o(x)' = —ibel(x)' 
5(Iu6) = 18 0,6(2), 5(0.6)' = 18 (Qu0(x))* . (2.106) 
Using the Euler-Lagrange equations (2.19), this is 
L=ip a ( of 6) Oy, e 6) | = — bô, J" (2.107) 
*\5(Ou) ô(3 p)! “i 
where the Noether current is 
ôL 4> 
iat t| = igt Ote. 2.108 
a A aie 


Finally, for a symmetry!?, one has in addition that the Lagrangian density is unchanged, 
i.e., ĜL = 0, so that 0, J" = 0. 


2.5 ELECTROMAGNETIC AND VECTOR FIELDS 


Define the electromagnetic vector potential A” (x) and the field strength tensor 


F" = — F”! = Oh AY — OV AP, (2.109) 
related to the electric and magnetic fields E and B by 


0 22. -E, =z, 
Es 0 -B, B 


w 
F" = E, B, 0 ap, (2.110) 
E, —B, Bz 0 
In the absence of sources, one has the free-field Lagrangian density 
1 v l / p2 D2 
Lol A") = -3 Fw F” = 5 (Z -B ) i (2.111) 


leading to the free-field equation of motion OA” — ð! (0, A”) = 0. (We have omitted the 
free-field subscript 0 on F and A for simplicity.) 

Lo is gauge invariant, i.e., FWY = FWY = OHA” — O” A'!, and therefore Lo(A") = 
L(A’), where 


Alt = AM Larga), (2.112) 


10 The condition for a symmetry is actually weaker. It is sufficient for the action in (2.18) to be invariant. 
We will see examples of this for discrete symmetries in Section 2.10 and Chapter 7. For a continuous 
symmetry, it is sufficient to have ôL = —80,a", where a“ transforms as a Lorentz four-vector. In that case, 
the current J” — a is conserved. 
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and (x) is an arbitrary differentiable function of (t,7). (The 1/e factor, where e > 0 is 
the electric charge of the positron, is inserted for convenience.) This is known as a U(1) 
invariance because there is only one function (x). The U(1) gauge invariance will con- 
tinue to hold for the full Lagrangian including interactions. It is convenient to work in the 
Lorenz gauges, 0, AY = 0. This still leaves the freedom of making a further gauge trans- 
formation provided O68 = 0, and one can exploit this freedom to simultaneously choose 
A? =0and Ñ. A= 0, the Coulomb or radiation gauge. The radiation gauge does not have 
manifest Lorentz or gauge invariance, but is attractive in that it doesn’t involve any unphys- 
ical degrees of freedom. The two non-zero components of A“ are transverse to the photon 
momentum and correspond to the two polarization directions. In the radiation gauge, the 
free-field expression for the vector potential is 


A(x) = I me 5 [E (P, A)a(p, A)e T? + E* (p, Ajdi (P, Ajete], (2.113) 


where E, = |p|, a and a‘ are bosonic annihilation and creation operators satisfying 
[a(p, A), at (p", A] = (27 )22E, (PPO, (2.114) 
à = 1,2 refers to the two possible polarization states, and €(p, A) is the photon polarization 
in the direction of the photon electric field. The radiation gauge condition V -A = 0 implies 
P- E(P, A) =9, (2.115) 

and the normalization is 
é*(p, A) -E A) = av. (2.116) 
For a linear polarization basis, the € (p, A) are real and can be chosen 

€(p,1) x €(p,2) = p. (2.117) 


Thus, one can choose €(p,1) = pı where p, is an arbitrary unit vector orthogonal to p, 
and €(p,2) = p x 61; e.g., for p in the z direction, €(p,1) = (1,0,0) and €(p,2) = (0,1,0). 
One can also use the basis of left- and right-handed circular polarization vectors 

> py _ ED, 1) Fie (p,2) 

EL,R ny ar 2.118 

@) z (2.118) 

which correspond to photon helicities (spin measured with respect to the momentum direc- 
tion) of +1, respectively. It is useful to introduce the polarization four-vectors e” (p, A) = 
(0, €(p, A)), where 


p-ep, A) =0, (BA) - (9, X) = —yy. (2.119) 


Thus, the free-field expression for A” is obtained from (2.113) by replacing €(p,A) —> 
e (p, A). 

From (2.113) one sees that the amplitude for a physical process is of the form e” (p, A)-M,, 
for an initial photon, or e”*(p, A) - M, for a final photon, where the amplitude M, depends 
on p, and the other momenta and spins in the process. (The rules for constructing M, 
will be described in later sections.) Often, the initial photon beam is unpolarized, or a final 
photon polarization is not measured. In that case, one averages (sums) over the initial (final) 
polarization directions, yielding a rate proportional to 
|| S H(p, A)” (B, A)| MMi (2.120) 

à=1,2 
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or ¿M ,M* = M*M, for a final photon). This is easily evaluated using 
20 Bey H 


Hal apiy 
D H(p, Ae E, A) = -g +P ri Br, (2.121) 


where p, = (Ep, —p) so that p, p = 2F2. One can use gauge invariance to show that 
pM, = 0, so that the second term in (2.121) does not contribute,!! i.e., (2.120) is just 
-M,M**. 

The Feynman propagator for the free photon field is 


d'k _, a! 
OITA” (a), A” (x’)]|0) = ig” e *(@—-*') D p(k) + gauge terms, (2.122) 
(2m)4 
where j 


The additional terms in (2.122) are gauge-dependent (e.g., for the radiation gauge they in- 
volve the generalization of the second term in (2.121) to off-shell momenta). A full treatment 
of gauge issues is beyond the scope of this book (but see the discussion of the Rẹ gauges in 
Chapter 4). However, the additional terms do not contribute to physical processes by gauge 
invariance, so in practice one can simply assign a factor ig!” Dp(k) = —ig” /(k? + ie) for 
an internal photon line carrying momentum k in a Feynman diagram. 


2.5.1 Massive Neutral Vector Field 
The results for the free electromagnetic field are easily generalized to the case of a massive 
Hermitian vector (spin-1) boson V”. The free-field Lagrangian density is 
Lo = -aua + meV Ve, with GH” = ð" V” — oV". (2.124) 
It should be emphasized that Lo is not gauge invariant under 
V” > V" + 0” 8, (2.125) 


because of the mass term, and does not lead to a renormalizable theory. However, it is useful 
to display the relevant formulae useful for phenomenological tree-level calculations. 
The free field is given by 


Pps | 
H = a Hy = —ip-x pk (= ti +ip-« 
V"(a) I @rJ2E, 2 [et (p, A)a(p, A)eT P? + et* (p, Ajat (p, A)eT?™] , (2.126) 


with Ep = \/p? + m?. A massive vector has three polarization states. These include the 
two transverse states €(p,1) and e(p,2) (or eL, R(P)) similar to the massless case, and a 
third longitudinal (helicity-0) state 


5 Ean 
e(P, 3) = — (IP; 0,0, Bp), (2.127) 


11The analogous situation for QCD and other non-abelian theories is trickier; one can only drop the 
second term if ghost contributions are included (e.g., Peskin and Schroeder, 1995). Another subtlety will be 
encounted in the example of Compton scattering in Section 2.6. 
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where we have taken p along the z direction for definiteness. Note that the magnitudes of 
the components of ¢(p,3) become large at high energy, with «“(p,3) ~ p/m. The three 
polarization vectors satisfy (2.119), and the polarization sum is 


3 ‘ids 
S* (5, A)e* (B,d) = —gt” + 28 (2.128) 


m? ` 
à=1 
The momentum space propagator associated with an internal line in a Feynman diagram is 


_ 7b k” k” 
g + n 


a (2.129) 


iD (k) =i | 


The second term in Dy does not drop out of calculations and leads to bad ultraviolent 
(large k) behavior and thus to non-renormalizability. One also sees that the limit m —> 0 
is not smooth. 

A renormalizable gauge invariant theory of massive spin-1 fields, in which the mass 
is obtained by the Higgs mechanism!? rather than as an elementary term in £, will be 
discussed in Chapter 4. In that case, (2.129) will still correspond to the unitary gauge, in 
which the physical degrees of freedom are manifest, but there are other gauges in which the 
m — 0 limit is smooth. 


2.6 ELECTROMAGNETIC INTERACTION OF CHARGED PIONS 


We are now ready to combine the results of Sections 2.4 and 2.5. Consider the Lagrangian 
density 


£(d, A) = (8,8)'0"5 — mEt — TF FH” — Vilo, G) + Leal, A) (2.130) 


for a complex scalar field ¢ and the electromagnetic field A“. The scalar self-interaction Vr 
is defined in (2.89), and £4, describes the electromagnetic interaction. Its form is dictated 
by the requirement of gauge invariance under (2.112). The only way to do this (without 
introducing non-renormalizable interactions) is the minimal electromagnetic substitution, 
familiar from classical and quantum mechanics. One replaces 


ph — p“ — qA" <> ið” — qA" (2.131) 


in the Lagrangian density in (2.88), where q = e > 0 is the charge of the t*, and identifies 
the additional terms with £44. £ will then be invariant under the generalized gauge (or 
local) transformation 


1 
AM > A' = A" — 0" 
E eo ee) (2.132) 


dog = tbla) eg = Pg 


i.e., £(¢, A) = L(¢’, A’). Equation (2.132) generalizes the global symmetry of Section 2.4.1, 
i.e., 6 > etf o where 8 = constant. 


121t is also possible to construct a U(1) gauge invariant theory for a massive vector without the Higgs 
or other spontaneous symmetry breaking mechanism by the Stiickelberg mechanism (Stueckelberg, 1938; 
Cianfrani and Lecian, 2007). 
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Using the minimal substitution, 


L = [(, + igA,)o]'(O" + igA")d — m2 gto - iue — Vi(¢,¢") 
| m 
(8,19, )6t (2.133) 


1 
= [Duo] [D"¢] — moto- FF FM — Vid, 9"), 
where 
D” = Ə! + iqA” (2.134) 


is known as the gauge covariant derivative. 
The gauge invariance of (2.133) is obvious except for the first term. Under (2.132), 


D" > D" ¢' = [a Lig AH i2a"a(2)| (e #a)g) = = eP) pg 
D + (Dud)! p e, 


That is, the shift in A” compensates for the change in 6” ¢ due to the derivative of 3, leaving 
L gauge invariant. Thus, gauge invariance for the charged scalar requires the existence of a 
spin- 1 field, dictates the form of the interaction with A“, forbids an elementary mass term 
(ie., im 2 A,A" is not gauge invariant), and restricts the form of the scalar self-interactions. 
It tras out that the theory is then renormalizable. 

From (2.133) one can read off iL, a: 


Loa = —G(O"b) Ay + (GIOND)A, + iG? A, A" old 
= qD" G) A, + ig? A,A" to, 


(2.135) 


(2.136) 


where iot Bro is the Noether current for the free complex scalar field!’ in (2.108). The 
Feynman vertex rules can be found from (2.136), inserting the free-field expressions for ¢ 
and ¢! in (2.93) and recalling that a(p’) and b(p) are the annihilation operators for 7+ and 
m`, respectively. For this purpose, it is convenient to explicitly display 


CePA ; . 
go) =) apoE, [-ipta(p)e"?* + ipb (pe? | (2.137) 


ep i i 
"g(x = I Toan [ip b(p e P? + ipta (p)et?*] , 
where p = (Ep, P) with Ep = yP? +m? as usual. 

The vertices are displayed xi Figure 2.10 for q = e. The three-point vertices include a 
contribution —iep", for each 7+ and a +iep_ for each 7. These are always the physical 
momenta, whether initial or final. They can both be written as —iep! +, where p,+ = +p,+ 
is the momentum in the direction of the arrow. There is also a four-point (seagull) vertex 
2ie*g"”, which is needed for gauge invariance. The Lorentz indices are contracted with 
€u(k, À) for an initial photon of momentum k, with e% (k, A) for a final photon, and with 
iguvDr(k) = —igur/(k* + ie) for an internal virtuàl photon line. Each internal pion has 
a propagator iAp(k) = i/(k? — m? + ie), and there is an integral f d*k/(2m)* over each 
unconstrained internal momentum. 


13Interaction terms usually do not modify the form of the Noether currents. Gauge interactions of complex 
scalars are an exception because the interaction terms involve derivatives. For example, the conserved 
+> 
Noether current for £ in (2.133) is JM = iġt Ho — 2qġt A“, which appears in the field equation for A in 
the familiar Maxwell form ð F” = qJ”. 
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Figure 2.10 Vertices involving charged pions and one or two photons. In the one- 
photon diagrams the initial pions enter from the bottom and the final leave from 
the top. Antiparticle (m7) vertices are obtained from particle ones by twisting the 
lines and replacing p by p = —p, where p is the physical four-momentum and p 
follows the direction of the arrow. The wavy lines represent photons. 


nK and rr Scattering 


As a first example, consider the electromagnetic scattering r7 Kt — m~ Kt, where K* is 
a complex field with the same electromagnetic couplings (but different mass) as 7+. They 
are introduced here to avoid identical particle effects. We ignore strong interactions of other 
non-electromagnetic couplings. There is a single tree-level diagram, as shown in Figure 2.11. 
The Feynman rules lead to the transition amplitude 


Mri = ie(pı + p3)” | | (—ie)(p2 + pa)” 


(pı + p3): (p2 + pa) _ er ee 
(pi — ps)? -i ( t ): 


(2.138) 


= —4ria 


where a = e?/4r is the fine structure constant. The Mandelstam variables s, t, and u are 
defined in (2.31) and are related to the CM scattering angle in (2.40). From (2.57) the 
differential cross section in the CM is 


do — 1 | jaro s/2p? +1 + cos0 í 
dcos@ 32ns' 7° 2s 1 —cos0 


, Tae 3+cos0\? 
Vs>mx 28 \1l—cosd/) ’ 


(2.139) 
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Figure 2.11 Feynman diagrams for r` K+ —> m~ KT (top), 7 
dle), and tt n — a+ x7 (bottom). 


where p ~ (s — m%)/2,/s since mx >> mr. The total cross section is 


+1 


dcosð. (2.140) 


For m+ a* — ntr” there are two Feynman diagrams due to the different ways of 
associating the fields in ila with the identical external particles, as shown in Figure 2.11. 
The transition amplitude is 


Mji = ie? E + p3): (p2 + pa) , (pi + pa): (p2 + ps) 
to _ 2 : _ 2 
(p3 pı) (pa pı) (2.141) 
: s—u s—t 
= AtTia | + | . 
t u 


The differential cross section is again given by (2.57), but in this case there is an extra 
factor 4 in the total cross section because the final particles are identical. At high energies, 
Vs > Mr, one finds 


dcos 0. (2.142) 


do Qa? G2) i. do 
_ a 


dcos@ s sin? 0 ~ 1 deos 


14The integral is divergent for forward scattering due to the massless photon propagator pole, i.e., the long 
range Coulomb force. This divergence is already present for classical Coulomb scattering, and disappears 
for realistic situations in which screening by other charges or the finite resolution of the detector are taken 
into account. Similar comments apply to mtrt for both forward and backward scattering and to m7 1. 
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There are similarly two diagrams for rt n7 — ntr”, yielding 


Me = ie? —(pi + ps): (p2+pa) _ (Pı — P2) > (p3 — pa) 
j (p3 — pı)? (p1 + p2)? 
i (2.143) 
>. |[—-s+u u-t 3 + cos“ 0 
= Aria + = 4ria | —————_ |. 
t s 1 — cos 0 


In this case, however, the final particles are not identical. Note that the amplitude for 7+ m7 
can be obtained from that for ++ 7+ by the formal substitutions py > —pz and py > —pa, 
an example of crossing symmetry. That is, the amplitude for an outgoing 7* of momentum 
p is the same as that for an incoming m~ with momentum —p, as is apparent from the 
Feynman rules. Of course, the physical values of p are different for the two cases, since p? 
(—p°) must be positive for the first (second) one. 


Pion Compton Scattering 


Now consider pion Compton scattering, y(k1,\1) rt (p1) —> Y(k2,A2) + (po), with s = 
(ki +p1)?, t = (k2 —k1)?, and u = (pọ — k1 )?. The amplitude corresponding to the diagrams 
in Figure 2.12 is 


Mpfi = Cy Ely —ie(k2 + 2p2)"- — me (—ie)(kı + 2p)” 


(2.144) 


: 1 n v n v 
ie(2 pı — ke) =z ie)(2p2 — kı)” + ie? gt”| , 


H 
u — 


where €2 = elka, A2) and «, = elki, 1). It is straightforward to show that My; vanishes 


(ke; Aa) T (P) (Èa, Aa) n+ (P) Ilka, A2) © * (Ba) 
a” ke F Pa 
¢ 
H P —k , “ 
lkiı+p + jee + 4 
b. r Y, 
` ra ` 
~y(kı, A1) at (pi) at (pi) (Ra, Ai) (ki; Ai) wt (pi) 


Figure 2.12 Diagrams for pion Compton scattering. The third (seagull) diagram is 
required by gauge invariance. 


if €1, is replaced by ky, or e3, by k2,, which is a manifestation of gauge invariance. This 
would not occur without the 2ie?g”. 

Recall that our radiation gauge formalism is not manifestly Lorentz invariant. It is 
convenient to choose A° = 0 and V- A = 0 to hold in the laboratory frame, in which 
pı = (m, 0). Then, pı < €; = pı « €5 = 0, as well as kı - €; = k2 -€5 = 0, so that My; takes 
the extremely simple form 

My, = 8Triac3 &. (2.145) 


From (2.65) and (2.46), 


do a? kN? ; 
dcos 6, Z a (2) Er (2.146) 
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where 6; is the laboratory angle of the final photon and 


ko _ 1 
kı 1+ "(1—cos6;)) 


(2.147) 


For an unpolarized initial photon and unobserved final polarization one must average 
(sum) over the polarization states A1 (A2), yielding 


da 1 do 
= : 2.148 
dcos 0, 3 2 dcos 6, ( ) 
1,A2 


This can be evaluated using (2.121) for both the A; and Az sums,!° with 


kr = kı(1,0,0,1), ký = k2(1,sin 0z, 0, cos Oz) (2.149) 
ki, = kı(1,0,0,—1), kb. = k2(1, — sin 0z, 0, — cos 0z), l 
yielding 
1 , 1 
5 So la à? = J4 + cos? 0z). (2.150) 
Arr2 
In this case, however, it is simpler to evaluate the sum using the explicit forms 
1) = (0,1,0,0), 1) = (0, cos 0z, 0, — sin 0 
a(l)=( ) e(1)=( L inz) (2.151) 
€ (2) = €2(2) = (0,0, 1,0). 


for the transverse polarization vectors. 


2.7 THE DIRAC FIELD 


The Dirac field q(x), where a = 1---4 is the spinor index, describes a four-component 
spin-$ particle, i.e., y annihilates the two possible spin states for a particle and creates the 
two spin states for the antiparticle. In the absence of interactions, the Lagrangian density 
is 


Lo = U(2)a (iP — mM) a4 V(t) 3 = U(x) (i 9 — m) y(x). (2.152) 
The sum over a and ĝ is written in matrix notation in the second form, in which w is a 
four-component column vector; the Dirac adjoint 


P(x) = p(x) (2.153) 
is a four-component row vector; and 
p= ato = "On, (2.154) 
Ox 
where 7", u =0---3, are the 4 x 4 Dirac matrices, defined by 


{7,7} = 2g”. (2.155) 


(There is an implicit 4x 4 identity matrix on the right side of (2.155) and after m in (2.152).) 
The 7" must also satisfy 


(E = qty? = Yp, (2.156) 


15Since we have already evaluated Mf; in a specific gauge, we cannot drop the second term on the right 
of (2.121). 
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so that Lo = L}. It is useful to define 


VESPE, = z0 |. (2.157) 


° enters for spin and chirality projections, for coupling fermions to pseudoscalars, and for 
axial vector currents in the weak interactions. From (2.155), 7° = 75t, 7°y# = —7"7°, and 
(7°)? = I. o!” is useful, e.g., in the description of electric and magnetic dipole moments. 
An arbitrary 4 x 4 matrix can be written as a linear combination of I,7°,y",y“y°, and 
a!” For example, o#”7° is given in Problem 2.9. 


2.7.1 The Free Dirac Field 
From (2.152), one obtains the free-field (Dirac) equation 


(i9 —m) p(x) = 0. (2.158) 
The solution to (2.158) is 
37 2 , 
V(x) = i OE XO [u(B, s) a(p, s)e™™? + v(p, s)b' (p, se"? *) , (2.159) 


s=1 
where at (p, s) and bt (p, s) are the creation operators for e~ and e+ states, respectively,'® 


a'(p, s)|0) = [e7 (P, 8)), b? (P, s)|0) = |e* (P, s)). (2.160) 


p refers to the physical momentum for both |e? (p, s)), and s runs over the two independent 
spin states. a and b and their adjoints satisfy the anticommutation rules in (2.10)—(2.12), 
e.g., {al 5), at (B', 8')} = (2m)32B, P- Pies. 

In (2.159), u(p,s) and v(p,s) are four-component Dirac spinors (complex column vec- 
tors). From (2.158) they are the solutions to the momentum space Dirac equation, i.e., 


(# — mul”, s) = (pu 3” — m)u(p, s) = 0 


(Ø + m\w(p, s) = 0. (PRE 
Taking the adjoint of (2.161) and using (2.156), 
Kg- mju = ut (put — m) = ul (puyi — (9°) m) (2.162) 
= ti(p— m)7° =0, 
where ū = uly° and ọ = v'7° are the Dirac adjoints. Thus, 
EE mi) = a(6, E <0. (2.163) 


Before proceeding to the electrodynamics of fermions, let us consider some properties of 
the Dirac matrices and spinors. 


16By convention, we are taking Y = pe- to be the e~ field. The et field Y° = w,+ is of the same form 
as (2.159) except a and b are reversed. Charge conjugation and space reflection will be discussed in more 
detail in Section 2.10. 
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2.7.2 Dirac Matrices and Spinors 
Explicit Forms for the Dirac Matrices 


The Dirac matrices are defined by (2.155) and (2.156). For most calculations one does not 
need their explicit form, but it is occasionally useful to have one. For example, the Pauli- 
Dirac representation is useful for studying the non-relativistic limit of an interaction, while 
the chiral representation is useful at high energy and for Weyl or Majorana fields, encounted 
in neutrino physics and supersymmetry. In the Pauli-Dirac representation 


0o I 0 i 0 gt 5 0 I 
YZ -7 Y=- 0 YFZ 0 
Oi; 0 a gi = P ok 0 

7 =o 0 Seiko gk}? 


where I is the 2 x 2 identity matrix and o’ are the Pauli matrices in (1.17). Similarly, in 
the chiral representation 


(ion © o0 Y= 9 ~\ o Z 


(2.164) 


i 2.165 
=i (7 © oie [7 0 ve 
~~ 0 gi — Cijk 0 ok : 
It is sometimes convenient to rewrite the chiral representation matrices as 
0 ot 
ü 
P= & a ; (2.166) 
where 

ot = (I,ġ) o" = (I, —0) = op. (2.167) 


Traces and Products of Dirac Matrices 


Most calculations can be carried out without using the specific forms of the Dirac matrices 
by using various trace identities, where the trace is Tr A = )>,, Aaa for any square matrix 
A. Note that Tr A = Tr A? = (Tr At)* and Tr(AB) = Tr(BA). For any representation, 
(2.155) and (2.157) imply 


Trl =4, Try’ = Try’ = 0. (2.168) 
Define the 4 x 4 matrix 
d= yan (2.169) 
for an arbitrary four-vector a,,. Then, from (2.155) 
d¥=—-Pd+2a-bI > dg=da’l. (2.170) 
One has immediately that 
Tr(¢d Y) =4a-b. (2.171) 


Other useful trace identities include (see, e.g., Bjorken and Drell, 1964), 
Tr(d¥¢d)=4(a-b c-d+a-d b-c—a-c b-d) 
Tr( d Y) =0 (2.172) 
Tr (7° á y ¢ d) = 4ie”” PT Apbvepde, 
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where e#”?? is the totally antisymmetric tensor with €o123 = +1 and €°!23 = —1. Also, 


Tr(@i-++ dn) = Tr (7? di dn) =0 (for n odd) 
Tr (fi +++ gn) = a1 + a2Tr (d3 -+ dn) — a1 -a3Tr (d2 da't dn) 


2.173 
-- + a1 ` QnTr (do--+ dn—1) (for n even) ( ) 
Tr (di d2: dn) = Tr (dn: do dr). 
Related useful identities are 
H = 4I L H — —2 
a Yu AY : d (2.174) 
Yu APF =4a-bI, yA PEY =—2 gY d. 
Finally, we record the identities 
Tr [Ya av ae + dy) ] Tr [> dy’ d (1 + ry) ] 
_ f 64a-cb-d for + (2.175) 
64a-db-c for — °’ 


where ÀA = +1. These results can be derived using the identities in (2.172) and Table 1.2, 
as will be shown in detail in Section 7.2.1. They are extremely useful for calculating four- 
fermion polarization effects, as well as for weak interaction decay and scattering processes. 


Spinor Normalization and Projections 


The Dirac spinors satisfy the normalization and orthogonality relations 


u(p, s)u(p, aa —0(P, s)u(p, 8") = 2m bee" 
ul (p, s)u(p, ae boa! (2.176) 
alp, s)o(B, 8’) = ul (8, s)o(—9, s") = OH, s)u(D, s’) = vt (P, s)u(—#, s') = 0, 


Sup, s) up,s)=p+m, 5 _ o(p,s) (8, s) =% -— m, (2.177) 


sS S 


which are useful in summing over spin orientations in physical rates, are 4 x 4 matrices. The 
form of (2.177) follows from the Dirac equation, while the normalization follows by taking 
the trace and using the first equation in (2.176). (The second equation (2.176) for s = s’ 
then follows by right-multiplying (2.177) by y° and then taking the trace.) 

The projections 


ud, 8) UG, s) = (p+ m) Gass 


v(p, s) 0B, s) = (~—m) ci r) 


are useful when one does not want to sum over spins. Here, s“ is the spin four-vector. In 
the particle rest frame, p = (m, 0), it is just a unit vector 


(2.178) 


s = (0,8) (2.179) 
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in the spin direction, so that s? = —1 and p-s = 0. Boosting to an arbitrary frame in which 
p= BE, = ym, : 

= (78-8, y+ 81), (2.180) 
where êj = $- BB and §,; = § — j are, respectively, the components of § parallel and 
perpendicular to 6. Thus, s = (0, 8) for &- Ê =0, while s = s4 = = +7(2, Â) for &- Ê = +1. The 


latter are known as the positive and negative helicity states, respectively, or alternatively 
as right- and left-handed states. The projections in (2.178) simplify greatly for the helicity 
states in the relativistic limit: 


5 PE Ad 
w+ (=A) g ST pe Pas 
AT ae (2.181) 
(p m ( 3 >) T p= PLR B, 


where the chiral projection operators Pry, will be discussed below. 


The Propagator 


From (2.159) and the spinor sums (2.177), one obtains the Feynman propagator for the free 
Dirac field!” 


T, - dtk —ik-(x—2! 
(OT Wa), PENO = 4 | Sree) Se), (2.182) 
where the momentum space propagator Sp (k), which is a 4 x 4 matrix, is 
1 
Sr(®) = meee (2.183) 


K—-m+ie k?—m?+ie 


The last equality follows from (2.170). 


Explicit Spinor Forms 


Explicit forms for the Dirac spinors are occasionally useful, e.g., for considering non- 
relativistic limits. In the Pauli-Dirac representation 


u(p,s) = yEp+ m — ; (2.184) 
Ep+m Ps 


where ¢, is a two-component Pauli spinor describing the spin orientation. Thus, 


a=(4)) =(1) (2.185) 


describe spin orientations in the +2 directions, respectively. ø+, defined by 


hoz = a i pot = 450s, (2.186) 


17 The time-ordered product of two fermion fields is defined as in (2.27) except there is a minus sign before 
the second term. 
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describe positive (negative) helicity states. Similarly, the v spinors are 


op 
u(p, s) = /Ep +m (efor) 


Xs 


where 
a - 2x 
Xs = ~io g}. 


ws (E) 


Thus, 


39 


(2.187) 


(2.188) 


(2.189) 


represent spins in the +2 directions. The positive (negative) helicity spinors, which continue 
to mean that the physical spin is parallel (antiparallel) to the physical momentum, satisfy 


ss ato al 
BO Ee = FKE 


(2.190) 


The counter-intuitive results in (2.189) and (2.190) are considered in Problem 2.13 and in 
the discussion of charge conjugation in Section 2.10. Explicit forms for the helicity spinors 


are given in Table 2.1, using a phase convention for which 


io°¢ = Foy, iP?’ X4 = FXF 


(2.191) 


TABLE 2.1 Explicit forms and properties of the helicity spinors“ corresponding to spherical 


angles (0, p) for p. 


Rotation: R(0, p) = e7173 2 e125 ets $ 
H=- a) H=) 

52 — sin be~’? 
H= = ( 503 ) 6-0) =x) s ) 

in j bemir 

wA =A, e) Aaa h PE ) 
os (p) = te**? os (P) xa (P) = +e? x= (—p) 
X+(Ê) = —io* + (P)" = p- (P) x- (Ê) = —io?°¢p- (P)* = -+ (Ê) 
Orthonormality: pl (P) (P) = dij = x} (Px (D) 
Completeness: Digi (Pet (p) = ya Xi (p)x! (6) =I 


“The spinors are constructed using the rotation s+ (p) = R(@,y~)s+4(Z), where s = ¢ or x. The spherical 
angles of —p are (m — 6, t + p), so that —Z = (7,7). The orthonormality and completeness relations also 


apply to the fixed spin axis basis. 
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The explicit u and v spinors in the chiral representation are 


[Bp tm [0 a) t - (x7) 
2 (1+ z z4) ds VP Os 


egy Eam { C- a) \ _ ( vivax. 
2 A Gta) of VP 


where o” and g” are defined in (2.167). The second form is easily verified using (1.19) on 
page 4. In the helicity basis, these are 


wo VER ag) a yE A) e 


Ax (p) = 14 5 i z (2.194) 


The chiral representation is especially useful in the case of massless or relativistic fermions, 
for which the upper (lower) two components of the positive (negative) helicity u spinors 
vanish, and oppositely for v, 


where 


daam aa 
m—>0 P+ m—+0 0 
j (2.195) 
PE X+ s 
Chiral Fields 
For a fermion field Y, one can define left (L)- and right (R)-chiral projections 
1-7 145 
Yr = Pry = Yy, Yr = Pry = 7 Y, (2.196) 


where PÊ R = =P, BR) Pr, Pr = PrP, = 0, Pİ p= = P; „R, and Pr + Pr = I. wr and wR can 
be viewed as independent degrees of freedom, with Y = Yz + Yr. For a massless fermion 
the L- and R-chiral components correspond to particles with negaue and positive helicity, 
respectively, i.e., Yz and Wr annihilate fermions with helicity h = + 5. For antifermions it 
is just the reverse, Yr, and Wp create antifermion states with h = ai For mass m Æ 0 the 
chiral states of energy E associated with Yz, r have admixtures of O(m/ E) of the “wrong” 
helicity (Problem 2.15). The free-field Lagrangian density in (2.152) can be rewritten in 
terms of the chiral projections as 


L= Ņri ð Yr + Yri ð Yr- m (YpLYr + RYL), (2.197) 
where Y% L,R are defined!’ by 


pr = (YL) = Y Pry = YPr 
Yr = (vr) = yt Pry = YPr. 


(2.198) 


18 Some authors use the notation wy or (Yg) rather than wy to emphasize that the Dirac adjoint operation 
acts on wy rather than on w. 
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The chiral projections %z,r are also known as Weyl spinors or Weyl two-component 
fields. They can be described as above in four-component notation, i.e., as two-dimensional 
projections of four-component fields Y, but it is often convenient to discard the superfluous 
components and work in two-component notation. As the name “chiral” suggests, this is 
most conveniently displayed in the chiral representation, for which 


P, = ie ae Pr = 2 l (2.199) 


pa eal ), Pyh a) (2.200) 


where Vy pr are the Weyl two-component fields. 
The Lagrangian density (2.152) for the free Dirac field can be written in terms of the 
Wey] fields as 


Thus 


Lo = Y} iad Vr + Vhia Vr -m (VYR + UUs), (2.201) 


where the four-vectors o” and g” are the 2 x 2 matrices defined in (2.167). The Dirac mass 
term couples the L and R components, while the kinetic energy terms are diagonal. The 
free-field Dirac equation becomes 


io" 0, Yr — MYR = 0, io ô, VR — my = 0. (2.202) 


Above, we introduced the chiral fields as projections of a four-component Dirac field. 
Alternatively, one can simply define chiral fields as those satisfying Yr = Prwz, or Yr = 
Prwp, i.e., not necessarily as projections of another field w, and in fact this was done for 
the L-chiral neutrinos in the original formulation of the standard model. Equivalently, Weyl 
fields Yz or Vp can be introduced independently of each other. For example, a single Weyl 
L field with Ly = Wi iad, L would describe a massless negative helicity particle and a 
positive helicity antiparticle. 

One can also define the chiral projections of the u and v spinors, 


uL R(P, 8) = Pr, RU(P, s), UL,R(D, s) = PL, RU(P, 8). (2.203) 


ial e Bsa (2.204) 


then uz, r satisfy the Dirac equation 


If one writes 


-5u = (EI +8- p)ur = mu 
dade uae DL z (2.205) 
p:cour = (Epl -2 -pP)ur = muz. 


The chiral components of v = (vz vp)? satisfy similar equations, only with m —> —m. It 
follows easily that the solutions for u and v are given by (2.192). 

For m — 0 the equations for uz, r decouple. The u and v spinors of definite helicity 
given in (2.195) then coincide with the left- and right-chiral projections, 


Pru(p, +) = u(p, +), Piu(p, —) = up, —) 


Pro(p, —) = v(p,—), Pop, +) = o(p, +), (2.206) 
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showing the flip between chirality and helicity for the v spinors, consistent with (2.181). 
The free-field expressions for Yz, R or Vz r are especially simple in this case. From (2.159) 
and (2.195) 


3> 
ween = | ap VE O) ABF Exa Ee e], (2.207) 


which is similar to the free Dirac field except there is no sum on spins. 
The Weyl two-component formalism is further developed in Section 2.11 and in Chapters 
9 and 10. 


Bilinear Forms 


Consider the bilinear form w2Mwj,, where w1,2 are any two Dirac u or v spinors. They 
may even correspond to particles with different masses, which is relevant, e.g., for weak 
interaction transitions. M is an arbitrary 4 x 4 matrix. Then, 


(W2Mw ,)* = Muz, (2.208) 
where M = ° Mta? is the Dirac adjoint of M. One finds 
M= M for M= 1,4", y, 0%” 
M = —M for M = 77, 0o” 9” (2.209) 
MıMə = Mz Mı > fı fo-+: dn =fin- +> flo fi. 


There is an equivalent relation, 


(2M)! = pı My», (2.210) 
for two Dirac fields, which may be the same or different. Then, for example, 


5 lūz Mu? = 5 üs Muıŭı Muz 


51,52 51,52 


— 5 Uza [M(øØ + mi)M | og U2B 
s2 (2.211) 


M(pi + m)M (>> rata) 


= Tr [M (pr +m1)M (po + ma)| ’ 


which allow one to express a physical rate in terms of a trace. (This is sometimes referred 
to as the Casimir trick.) 
For chiral spinors 


= Tr 


ULR= (wr.r)’ 7? = wt Pr ry = wily PR L = WPr.v. (2.212) 
Therefore, for any two spinors w; and we, 
Wi, wr; = 0 = WR WR (2.213) 
for T = I, %5,0”, or c#”7°, while 
TL war = 0 = TRl wor (2.214) 


for T = 4! or 77°. Equivalent relations for the chiral fields Yz R were used in (2.197). 
Thus, scalar, pseudoscalar, and tensor transitions reverse the chirality between an initial 
and final fermion, while vector and axial vector transitions maintain it. 
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The Fierz Identities 


The Fierz identities are 
(pir ywr) (dsr bar 


(iry por) (vara 
(pir ber) (Psr Yapar 
) 


(dirver) (Wsrbar 


—np (pary bar) (Usr Iver) 
= -nr (pır par) (V3RYuV2R) 
= Aj (pirar) (v3rv2R) (2.215) 


)= 
) 
) 
ja T (dinar) (Warner) + * (piro war) (Y3ROw 2L), 


where Yig and pjr are anticommuting chiral fields and nr = —1. There are analogous 
relations for u and v spinors, but with nr = +1, e.g., 


(iry wL) (sL Yawa) = — (iry wa) (LY wL). (2.216) 


The Fierz identities are easily derived by expressing the 4 x 4 matrices such as 2r 3, in 
terms of the complete set 1,45, y”, 74°, and o””. A related useful identity is 


(piro per) (W3reuvar) = 0. (2.217) 


The Fierz identities are frequently very useful in computations, and are often used in con- 
junction with those for charge conjugation (Section 2.10). 


2.8 QED FOR ELECTRONS AND POSITRONS 


Just as for pions, one can obtain the Lagrangian density for quantum electrodynamics 
(QED), i.e., for the gauge theory of electrons and positrons interacting with photons, by 
combining the free-field Lagrangians in (2.111) and (2.152) and applying the minimal elec- 
tromagnetic substitution, p” > p“ — qA} <> ið” — qA”, where q = —e < 0 for the electron 
field. Thus, 


= Bla) (9 + eA- m) Yla) — FFF” (2.218) 


where 
D =3"Dp, Dy = Op — teAp- (2.219) 


L£ is clearly invariant under the gauge transformation 
AM — AP Tarpa), y > eb B@legy = eE, (2.220) 
since D,Y > cP) Dw, in analogy with (2.135). In the last form of £ 
JO (x) = —p(z)y" y(x) (2.221) 
is the (conserved) electromagnetic current, i.e., the Noether current. 


In Feynman diagrams electrons and positrons are represented by solid lines, with an 
arrow indicating the direction of flow of negative electric charge (or opposite the flow of 
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positive charge). The interaction term in (2.218) implies three-point vertices involving one 
photon and two charged fermions, with a factor iey,. Charge conservation implies that 
there is always one arrow entering (i.e., entering e~ or exiting e+) and one leaving (exiting 
e7 or entering et) the vertex. A final e~ has a factor u(py, sf), while an initial e~ has a 
factor u(p;, si). The corresponding factors for a final or initial e* are v(py, sf) and (pi, si), 
respectively. These are always the physical momenta and spin. An internal fermion line 
carrying momentum k corresponds to the propagator iSp(k) = ip. The spinor 
indices always arrange themselves so that each fermion line running through the diagram 
is a bilinear form starting with w or y and ending with u or v. These rules are illustrated 
in Figure 2.13. As in Section 2.6 there is also a factor ef, (€„) for a final (initial) photon, 
a propagator ig,,Dr(k) = —igu,/(k? + ie) for an internal virtual photon, and an integral 
J d'k/(27)* over each unconstrained internal momentum. 

Since e* are fermions, the overall sign of a diagram depends on the ordering of particles in 
the states. The overall sign is rarely needed, but there may be crucial relative signs between 
two diagrams due to the anti-commutation rules for the creation and annihilation operators. 
In particular, there is a relative minus sign between diagrams involving the exchange of two 
external lines, and a factor of —1 for a closed fermion loop, as illustrated in Figure 2.14. 
More complicated sign ambiguities are best resolved by going directly to the expression for 
the transition amplitude. 


u(Ps, ss) v(Pr ss) 
\ N 
teyu ` vy teyu y Y 
f ld 
U(Pi, si) (Pi, Si) 
Y =n = 
alp- s-) VBa 84) 
À ieyu q eu 
<N ; 
u(p_,s_) (P+, S+) i 


Figure 2.13 Vertices involving the interactions of e+ with a photon. The initial 
fermions enter from the bottom and the final leave from the top. + refer to a 
positron or electron, respectively. Note the crossing symmetry, i.e., up to an overall 
sign the amplitude for an initial positron can be obtained from that for a final elec- 
tron except ü — U, always using the physical momentum and spin, and similarly 
for the relation of a final positron and initial electron. 
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Figure 2.14 Left and center: relative minus signs between diagrams involving ex- 
changed fermion lines, with u; = u(i, si). Right: a closed loop diagram, with an 
extra factor of —1. 


er Scattering 


First consider e~ (k1) t+ (p,) 3 e” (kz) + (p>). The lowest order diagram, shown in Figure 
2.15, yields the transition amplitude 


= > igh 
Mpi = teti(ke, 82) y,u(k1, $1) (x s) [—ie(p2 + pi)al, (2.222) 


where the Feynman rules for the pion vertex are taken from Figure 2.10. Using (2.57), the 


_,> > us(D3, S v 7(Pa, S4) 
a(R», s2) ip), “eae one 
te pitpe 
= +r 
u(x, 61) m (P) u (Pi, 81) TR TET 


Figure 2.15 Left: lowest order diagram for e` (ky) n+ (51) > e` (kz) r+ (pp). Right: 
diagram for e~ (p1) et (P2) > f(Ps) f (Pa), where f # e*. 


unpolarized differential cross section in the CM is 


da 1 do 1 1 2 1 — 2 
= = Mri = ——|M,;;,|*, 2.22 
dcos@ 2 dcos@ 3278 |; Z fil 3275. sil ( 2) 
81,52 $1,852 


2 


where s = (pı + kı)” and we have averaged (summed) over sı (s2). The squared matrix 
element is evaluated using (2.209), (2.211) and the trace identities in (2.171) and (2.172), 


let 


|M]? = zg + pı) (p2 T Di)v 5 (tay"u1) (u1 U2) 
81,52 
: e H v 
= 5 p (P2 + Pı)u(p2 + pı)y Tr [7" (Ki + Me)” (Ka + me)] (2.224) 
4 


e 
= 277z (P2 + pi)u(p2 + pi) [kt k3 + RY RE — gt” (kı - ka — m2)], 
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where u; = u(ki, s;) and t = (kı — k2)?. This is easily evaluated using the kinematic rela- 
tions in Section 2.3.4. For example, ignoring the pion and electron masses, (2.224) becomes 
167?.a?([(s — u)/t]? — 1). 


eet Annihilation 


Now, consider e~ (p1) e+ (p2) — f(B3) f (Ba), where f 4 e* is a fermion with electric charge 
Q ye. There is a single tree-level diagram, shown in Figure 2.15, which implies 


; e —igh? a 
eee ree ey ( ) Gemeni 


—iQre? 
S 


Üz Yp V4 Uy" un, (2.225) 


where s = (pı + p2)”; uz and v4 are spinors for a mass mf; and vz and uj; correspond to 
mass Me. Averaging (summing) over s1,2 ($3,4), the unpolarized cross section is 


da 1 pr 


= Ml? 2.22 
dcos@ 327s pi [Mji], (2.226) 
where 
= 1 2 ee 
IMP => D Me? = Te Dala- my v(t + mp) 
Si, 1=1---4 
x Tr [y"(Bi + me) 7” (p2 — Me)] 
4Q2et 
= = [PapPsv + Pav P3y — Juv (Ps ` pa + m*)| 
x [pips + pips — gM” (pı -p2 + me)] (2.227) 
8Q¥e4 2 
mad g2 [Pi - Pa p2 * P3 + pi ps p2: pa + mM} pi: po] , 


where me is neglected in the last line. In that limit, 


Pi2 = (E,0,0, =E), p34 = (E, +p sin 0, 0, +p; cos 0), (2.228) 
in the CM, where 
2 
E = p; = oe Pf = es = BrE. (2.229) 
Then, 
pı ` pa = p2 pa = E? (1 + By cos8) 
Pi: p3 = p2 pa = E? (1 — Bp cos 8) (2.230) 
pı p2 = 2E°?, m4 = (1 — BF)E”, 
implying 


do _ T Qła?’ By 
dcos @ 2s 
Integrating over cos 0, the total cross section is 


AnQa" Bs 3- Be f An Qa? 
Bpol : 


(2 — 87 + BF cos? 0) . (2.231) 


(2.232) 


g= 


3s 2 3s 
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Bhabha Scattering 


e- 3 et 4 = 
(p oo Pra (pa) e (ps) , e* (pa) 


Y \ 
T$ pitpe = ae 


Figure 2.16 Feynman diagrams for e` (p1) e*(p2) > e` (p3) e* (pa). The relative mi- 
nus sign between the two diagrams is independent of state conventions. 


The diagrams for Bhabha (e~et — e~ et) scattering are shown in Figure 2.16. The 
amplitude is 
Uzypvavey'ur —- Ug Yur vay"'va 
(pı + p2)? (pı — p3)? 


M =ie? | (2.233) 


Ignoring the e~ mass, the unpolarized cross section is then dē /d cos 8 = |Mp:|?/3275, where 


- 1 
Ma? = = D M} 


spins 


= Z (Yu Bay pa) Tr (I Piy” Bo) | Tr (Yu Piw Ps) Tr (Y! Bar” p2) 


4 s2 ! t2 


Tr (Yu Bay Boy" Bry’ ps) Tr (Yu Bi Poy” pay” A) 
st st ` 


(2.234) 


The traces are easily evaluated using the identities in Section 2.7.2, yielding 


= } } 
dcos 8 s s2 t2 st 


da mo? [t +u? 2u? J (2.235) 


Electron Compton Scattering 


The Feynman diagrams for e~ Compton scattering y(k,)e7 (p1) > y(k2)e7 (p2) are the same 
as the first two diagrams in Figure 2.12 except that the pions are replaced by electrons. The 
corresponding amplitude is 


M = 6&,,€1,(ie)t2 | ur. (2.236) 


1 i 
u V aw r] 
Y hoko me ? ho ho me 


It is straightforward to show that the differential laboratory cross section averaged 
(summed) over the initial (final) electron spins is given by the Klein-Nishina formula 


da ta? (ko ko kı 
= Ale ves = 2) 2.2 
ae (2) E + is + 4(€1 - €5) | (2.237) 
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k2/kı has the same form as in pion Compton scattering, Equation (2.147), except m — me. 
If one also averages and sums over the initial and final photon spins, one obtains (using 


(2.150)) 
da mo? (ky\* [kz ky 2 
= H in“ ðL], 2.238 
dcos, m2 (2) E p ae) 
which yields the classical Thomson cross section ojo = 87a7/3m? in the limit 


ko ~ kı <K Me. 


2.9 SPIN EFFECTS AND SPINOR CALCULATIONS 


We have so far mainly focused on spin-averaged calculations. However, many experiments 
involve polarized initial particles or measure the final polarization, e.g., by their decay dis- 
tributions. The calculation of fermion polarization effects can always be carried out using 
the standard trace techniques, which yield the absolute squares of amplitudes, provided one 
uses the spin projections in (2.178) and (2.181). However, it is often simpler to calculate 
the amplitudes directly using the explicit forms for the y matrices and Dirac spinors (e.g., 
Hagiwara and Zeppenfeld, 1986). Further simplifications are achieved because of space re- 
flection invariance, which relates different spin amplitudes. (Even for the weak interactions, 
which violate reflection invariance, different amplitudes may be related up to a known over- 
all coefficient.) However, such calculations are carried out in a specific Lorentz frame and 
are therefore not manifestly invariant. Both techniques can be tedious for non-zero masses, 
so in this section we will illustrate calculations assuming that the masses are negligible. An 
example of a calculation for a massive fermion is given in Problem 2.26. 

The spin-averaged amplitude-squared for e~ (p1) e+ (p2) —> f(p3) f (Ba), given in (2.227), 
reduces to 


|M]? = Qe" (1 + cos? 8) (2.239) 
for Me = my = 0. Now consider the amplitude for definite spins in the helicity basis, 
which we denote by M(hgha,hih2), where h; = +% is the i*” particle helicity. For massless 
particles, by (2.214) the only nonzero amplitudes are for hy = —hg and h3 = —hyg, since 
vector and axial vector interactions do not reverse chirality. We will see in Section 2.10 that 


reflection invariance implies that there are only two independent amplitudes, and that 


M (+-,+-)=M(-+,-+),  M(-+,+-)=M(4+-,-+4), (2.240) 


where we have simplified the notation by only writing the sign of h;. Momentarily keeping 
the masses, the absolute square for arbitrary spins is 


2,4 5 5 
me =A m {an -mp (HEE) n (he tmp) (AT) 


x Tr [at (a + me) (=) 1” (fo — me) = 2) , 


where s; is the spin vector given in (2.180). Taking the masses to zero and using (2.181), 
this reduces to 


(2.241) 


|M (+—,+—-) |? = |M (-+,-+) ? 


Qet ; 
= OP (Yu Paty PPr) Te (Q iy” PPr) (2.242) 
Qet 
= 7 (pi - pa p2: p3) = Qee* (1+ cos 0)”, 
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where the calculation of the traces and contraction of the Lorentz indices was carried out 
using (2.175). Similarly, 


Q? et 


IM (+=, =+) |? = |M (=+, +=) |? = ŽE (p1 - ps po pa) = QFet (1 — cos0)?. (2.243) 


Equations (2.242) and (2.243) reproduce the spin-averaged result in (2.239). 
Now let us repeat the calculation using the explicit gamma matrices and spinors in the 
chiral representation, given in (2.165) and (2.192), noting that 


yP =T, e er N E Ha (2.244) 
Then [using s = (V2E)‘], 
M (+-,+-) = aL t+ (3)ypv_ (4) v- (2)y u4 (1) (2.245) 


= iQ se? [6+(3)'x-(4) x-(2)'6+(1) — 9+8) ox- (4) x-2"), 


and 
M (—+, +—) = + iQre [6-3] x+ (4) x- (2144+01) 
; ang, (2.246) 
+ 6-(3)totxs (4) xt (2)o*ġ4 (1)] . 
From (2.228) and Table 2.1 the helicity spinors are 
a(g) ~@=($), #@=n= ( a 
- (2.247) 


The first terms in (2.245) and (2.246) vanish. The second terms are most easily evaluated 
using the SU (2) Fierz identities in Problem 1.1, 


PB XA XAD AA) = 28B A xD lA — AAA xD), (2.248) 


yielding 


0 
M (+—,+—) = M (-4+, —+) = 2iQ pe? cos” — = iQ re? (1 + cos 0 2.249) 
f 5 f 


0 
M (-+,+—-) = M (+—, —+) = —2iQ pe? sin? = —iQ se? (1 — cos). 


2.10 THE DISCRETE SYMMETRIES P, C, CP, T, AND CPT 


Space reflection and charge conjugation are symmetries of the strong and electromagnetic 
interactions, but are violated by the weak interactions because of their chiral nature. The 
product CP is also violated in the weak sector, though more feebly, due ultimately to 
complex phases in the Yukawa interactions between the fermion and Higgs fields. Such 
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phases may be large, but their effects are small because they require mixing between all 
three fermion families to be observable. Time reversal is also violated, as is expected because 
of the CPT theorem, which states that any local, Lorentz-invariant, unitary field theory 
must be invariant under the product CPT (Streater and Wightman, 2000). In this section 
we introduce the basic formalism for the discrete symmetries. Physical consequences and 
tests will be described in Chapters 7 and 8. More detailed descriptions of the discrete 
symmetries may be found in, e.g., (Bjorken and Drell, 1965; Gasiorowicz, 1966; Weinberg, 
1995; Sozzi, 2008). 


Space Reflection 


Under space reflection!® (P) a classical vector changes sign, e.g., 
T — -T 0 — —p. 2.250 
Bethy PP ( ) 


However, an axial vector such as orbital angular momentum L=2zx p is left invariant, and 
it is reasonable to define spin and total angular momentum to have the same property 


Ī— J. (2.251) 
P 


Both vectors and axial vectors transform as vectors under rotations. Scalars are rotational 
scalars that do not change sign under P. Examples are t, Æ, and |p|?. Pseudoscalars, such 
as J- p, do change sign. In particular, helicity, h = 5. p, is a pseudoscalar and reverses 
under P. Vector and axial?” four-vectors transform, respectively, as 


u u = 
V4 Vy, AM Ay, (2.252) 


so that x = (t,@) and p = (E, p) are vectors. The fermion spin vector s,, in (2.180) is an 


axial four-vector. The angular momentum J is part of a second rank antisymmetric tensor, 
L= Feije LI", where 


L.” = op’ — r” p” = Diri: (2.253) 
A single particle state is assumed to transform similarly, 
P|p's) = np|— Ps),  P\ph)=np|—p — h), (2.254) 


where (in this section) s represents spin with respect to a fixed axis and h represents 
helicity.?! The phase np is the intrinsic parity, which depends on the particle type. One 
must have np = +1 to ensure P? = J. Invariance of the action in (2.18) requires invariance 
of the Lagrangian, and therefore that 


PL (t, Z) P =L (t,—g). (2.255) 
For the example of the free complex scalar, 


= (3p) (3 p) — m7 44, (2.256) 


19More precisely, an inversion or point reflection. 

20We use the symbol A, both for gauge fields and for axial vectors. The meaning should always be clear 
from the context. 

21Tn the helicity form, there may be additional (P, h)-dependent phases, depending on the phase conven- 
tions for the helicity states. 
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this can be accomplished for 
Po(t,)P-! = npdlt, -2). (2.257) 


The mass term is obviously invariant. For the kinetic term, 


A EET Ge 


ôx” 


ð 
plz’) = near olr) = np" 4(z’), (2.258) 
Oxi, 
where a’ = (t,—@). Thus, |0,,6(x)|? — |0/,6(2')|?. Equation (2.257) follows from the free- 
field expression in (2.93) provided 
Pal (B)P™ = npa (-p), PH (B)P = npb'(—p), (2.259) 


where at (p) and bt (p) are the particle and antiparticle creation operators. The transfor- 
mation of a Hermitian scalar field is the same as in (2.257) and (for al (p)) in (2.259). 
For the free Dirac field, with 


L=4 (t, Z) lip- m] y (t,2), (2.260) 
we must choose Py (t, Z) P7} in such a way that 
P(t, Z) y(t, Z) P! = b(t, -#) v(t, -Z) (2.261) 
and 
Po (t, Z) yY (t, Z) P = v(t, -#) wu (t, —2). (2.262) 


The lowering of the index on y“ compensates for ô, = ô”, analogous to (2.258), i.e 
w(x) Zyl) = y(x) J'y(x'). These conditions can be satisfied if 


Py (t, Z) PT} = 7°v (t, -@) (2.263) 
=> Pb (t2) P~ = (Yy t,-2))' P =y -2)' Pr =F -2) 1", 
using y°7"7° = Y, from (2.156). (We have taken npy = +1 for simplicity.) For chiral fields, 
Pir, r(t, Z) P7 = Pr, (t, -2 ) 
Por r(t, Z) P7 = (Yr, (t,—-2)) 7 = bre (t, -2) 4. R 


Using the expression (2.159) for the free Dirac field and the relations”? 


Pu (6,8) =u(-B,8), Pv (P, s) = —v (—P, 8), (2.265) 


which follow from the explicit spinor forms in Section 2.7.2, we see that (2.263) is equivalent 
to 
Pat (p,s)P7} =al(—-p,s), Pb (p,s)P~ = —bt (—p, s). (2.266) 


Thus, a fermion and its antifermion must have opposite intrinsic parity. 
An arbitrary fermion bilinear form transforms as 


Popa (t, Z) Uy (t, Z) PTE = npanpoa (t, -T ) PTY (t, -7 ). (2.267) 


The values of T, = y°I'y° are listed in Table 2.2. In particular, T = 1,95, 9#, and yy? 
p Y 


?2For the phase conventions in Table 2.1, the transformations in the helicity basis are Pu (P, p+ (p)) = 
+e+t?u (—p, p4 (—P)) and 7°v (P, x+ (P)) = FeF*¥v (—p, xy (—p)), where o is the azimuthal angle of p. 
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TABLE 2.2 Transformation of the complete set of Dirac matrices and of their related 
chiral forms under Hermitian conjugation (the Dirac adjoint I’) in (2.210), space reflection 
(Tp) in (2.267), charge conjugation (Te) in (2.296), and time reversal (T+) in (2.325). 


r T T, r ie 


c 
ae ee Pry Sl TE*T-! 

Scalar 1 1 1 1 1 

Pseudoscalar 4 —75 —75 x5 5 

Vector y” y. Yp —yl Yy 

Axial vector gtg’ gtg’ =y yty’ wy 

Tensor ov ot Ops —gtv -Tw 

Pseudotensor® =a”? =o” 75 —owy old —Owy? 

SFP PLR Pr, Pry Pink Pir 

VFA PLR "PLR Was Pay PLR 
Oo Pre Of Prt On PRL —o""PrR = —OwPrR 


“The pseudotensor o#”y° is not independent (Problem 2.9), but is included for completeness. 


transform as a scalar, pseudoscalar, vector, and axial vector, respectively, for 7panpp = 1. 

An important example involves the left-chiral (V — A) currents pary” paz, which trans- 
form as 

Pipar oL P = Gar Yubor: (2.268) 

We will see in Chapter 7 that the weak charged current (WCC) interactions involve only 
V — A, so (2.268) implies that the WCC is not P invariant. 

As will be discussed in Chapters 3 and 5, the Yukawa interaction between a neutral pion 
n° and nucleons can be phenomenologically described by 


Lan = igr (Pp — nyn) r°, (2.269) 


where g, is a real coupling, and where p and n, respectively, represent the proton and 
neutron fields. Since the nucleon term is a pseudoscalar, reflection invariance requires that 
the 7? must be a pseudoscalar, i.e., 77 = —1. Of course, this was originally ascertained by 
experiment (see, e.g., Gasiorowicz, 1966). The pseudoscalar nature of the 7~ was determined 
by the observation of the reaction n7 D — nn, where the deuteron D has J? = 1+ and the 
mT was established to be in an S-wave. The neutrons had to be in an odd-parity 3P} state, 
since it is the only antisymmetric J = 1 state available. Similarly, the 7° was shown to have 
odd parity by the angular distribution in 7° — ete~ ete” (Abouzaid et al., 2008). The i 
in (2.269) is required by Hermiticity. 

The interaction terms in (2.133) and (2.218) indicate that the electromagnetic field AG 
couples to vector currents, where in this section, we use the symbol AG to distinguish it from 
an axial vector. Reflection invariance of the electromagnetic interactions therefore requires 
that A% transforms as a vector 


PAR (t, Z )P7! = Ag, (t,-2), (2.270) 


which is consistent with the classical expectation E> -Ë and Ë > +B. Using the free-field 
results in Section 2.5, this is equivalent to 


Pa! (P, An) PT = al (-p,—An), (2.271) 
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where the polarization label A, represents the photon helicity. This can be seen from the 
free-field expression (2.113), using the relation 


e! (D, Xn) = €u(—P, —An), (2.272) 
which follows from (2.117) and (2.118) under the convention 
(p, A) = (1) eu (p, A), A= 1,2 (2.273) 


for the linear polarization vectors. All of these results also apply to massive vectors, with 
three helicity states. 

The explicit transformations of the fields in (2.257), (2.263), and (2.270) were justified 
for free fields. However, the same transformation laws will continue to hold in the presence of 
the strong and electromagnetic interactions, or any others that respect reflection invariance. 
This is apparent in the interaction picture (see, e.g., Peskin and Schroeder, 1995, and 
Appendix B), in which the interacting fields can be formally expressed in terms of the non- 
interacting ones and the interaction Hamiltonian Hy, and using [P, Hz] = 0. To illustrate the 
importance of this, consider the matrix elements of the vector and axial currents, Varn 
and Darn Vo, respectively, where Ya, represent strongly interacting fields such as nucleon 
or quark fields, and a can be the same as or different from b. 


7 2.274 
(a(Ba, Sa) Pa Vu V bold, 80) = GPa, Sa Tj u(Db, 80). bral 


The fields are evaluated at x = 0 (otherwise, by translation invariance, there would be a 
factor e'(P«—P+)® on the right). In the absence of strong or electromagnetic interactions, one 
would have rY = Yu, and A = 77°. However, such corrections can yield more complicated 
matrix elements. From Lorentz invariance, eae can involve linear combinations of Yp, uvg”, 
dus Yu Y5, Suv’ y?, and quy?, where q = pa — pp. Each of these can be multiplied by an 
arbitrary form factor that can depend on the invariant qg?. Other four-vectors, such as those 
involving Pa, + Po, are not independent (see Problem 2.10). However, reflection invariance 
of the strong and electromagnetic interactions restricts the possibilities. Assuming the same 
intrinsic parities for a and b, 


(a(Da, 8a) |Yaubo|b(Ds, se) = (a(Ba, Sa)| P7 Paubo P P|b(Dn, sv)) 
= (a(—Ba, 8a) bay Ww bo|b(—Bs, 80)) 
= U(—Pa, Sa) (q Ju(—P, so) 
= U(Pa, Sa) PTY *(q')yu(pe, Sb); (2.275) 


where we have used (2.265). The notation [Y“(q’) indicates that it is evaluated using 
Gu = Pap — Phu = 9”. Reflection invariance therefore requires 


V — ~OpPVp 0 
Ty (a) = PT” (g). (2.276) 


Comparison with Table 2.2 shows that rY can only contain Yu, uvg”, and qu. Similarly, 
T4 can only contain 7,7, opvq” y, and q,7°. (Higher-order terms in the weak interactions, 
which violate reflection invariance, could induce the wrong terms. However, one can always 
treat such effects explicitly in perturbation theory, using P invariance for calculating the 
matrix elements.) 

As another application, consider M(a;(p;,h:)), the matrix element of a scattering or 
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decay process involving k external particles with particle types a;, momenta p;, and helicities 
hi, i = 1---k, some of which are in the initial and some in the final state. If the relevant 
interactions are reflection invariant, then it follows from the interaction picture expression 
that the amplitude is the same as the amplitude for the reflection-reversed process, 


M (a;(pi, hi)) = nM (a;(—pi, —ha)), (2.277) 


in which 7 = +1 is associated with the intrinsic parities and the spinor phase conven- 
tions (see Footnote 22). This result is especially useful in certain special cases for which 
M(a;(—p;, —hi)) = M(a;(p;, —hi)). For example, this follows from ordinary rotational in- 
variance for 2 + 2 scattering in the center of mass frame,?’ or for 1 — 2 or 1 > 3 decays of 
a spinless particle in its rest frame, because the momenta lie in a plane and can be reversed 
by a rotation by m around an axis perpendicular to the plane. In such cases, 


M (a; (pi, hi)) = nM (a; (pi, —ha)), (2.278) 


which can be very useful in simplifying calculations. 


Charge Conjugation 


Charge conjugation changes particles into antiparticles, without affecting their momenta or 
spin, e.g., 

Cla(p, s)) = ncala“(p, s)), (2.279) 
where a® is the antiparticle to a and cq is a phase factor. (It is basically a convention as 
to which is called the particle and which the antiparticle.) For a complex scalar field, this 
implies 

CHO = ncp, CAC = nb go, (2.280) 


since ¢! is the field for the antiparticle to ¢. Equation (2.280) implies 
Cal (p)C = nco (P), CU (P)O = nega" (P) (2.281) 


for the creation operators in (2.93). One can always make a phase transformation ¢! = 
V=nce? so that the new field has nog = +1. For a Hermitian scalar, one must have 
nco = +1, 


CoC-1=+¢, Cal(p)C71 = +al(p). (2.282) 


If one expresses a complex scalar ¢ with nce = +1 in terms of two Hermitian scalars ġ1,2, 
as in (2.91), then nog, = —nNcé, = £1. Results similar to (2.280)—(2.282) hold for the 
transformation of a vector field A,. For QED the gauge field couples to a C-odd vector 
current, so one must choose Ag, —> —Ag,. (C is not conserved for the chiral theories 
that will be introduced in Chapter 4, which involve both vector and axial currents with 
opposite C transformations. It is nevertheless useful to define the transformations of the 
gauge fields in this way.) The intrinsic C phases can be determined experimentally for (C- 
invariant) strong and electromagnetic processes. For example, 7° decays to two photons, so 
Nor = Noy = +1. 
For QED, the electron state is transformed into a positron, 


Cle™ (P, s)) = ncele* (P,s)), (2.283) 


?3Helicity is not Lorentz invariant in general. 
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and similarly for other fermions. One requires 


CoC = nyt, CHOT! = nowb* = ncy (6°) 70, (2.284) 


where w is the e~ (or fermion) field, and 7° is the e* (or antifermion) field. Only the relative 
phases of different fermion fields in bilinears are usually relevant;?* in the following we will 
set ncy = 1 for simplicity. From (2.159), the free fermion fields are given by 


2 


3 > 
we) = fase DL UBs) ase” + olp, ONG, sperm 


27)32Ep 4 
a (2.285) 
wo) = S EP D [uP s) D, s) + op, shal (p, set] 
(27)32E, = L , ? ’ , , 
so one must choose 
Cal (p, s)C—+ = b (P, s), Cb (p, s)C—1 = at (P, s). (2.286) 


From (2.285) it is apparent that ọ%° is related to the Hermitian conjugate Yt. The precise 
relation, which is extremely useful even if a theory is not charge conjugation invariant, is 


ye =C? =C (pt) =CTyT, ge = yte! (2.287) 


(in indices, YS = Cogbg = (CT) op ph). C isa 4x4 Dirac matrix, which can be determined 
by the requirement that the free Lagrangian density in (2.152) is C-invariant, i.e., 


CLoC~* = 4° (x) (i 9 — m) (x) = Lo. (2.288) 
Equation (2.288) is satisfied if and only if C satisfies 
Cyl = 7, (2.289) 


where one must use the anticommuting nature of the fermion field, and the fields are consid- 
ered to be normal-ordered so that c-numbers can be ignored.”° Consistency between (2.285) 
and (2.287) (or between (2.161) and (2.163)) further requires that the u and v spinors are 
related by 

vB 8) = Caps)” = (C7) ulBs)*", (P, s) = CoH, 5)", (2.290) 
which implies 
Tet, a(p,s) = —u(p, s)7Ch. (2.291) 
These relations hold both for the case of a fixed spin axis or for helicity spinors. The 
explicit form of C depends on the representation used for the y matrices and is seldom 
needed. However, C takes the simple form 


u(p, s) = —v(P, 8) 


C = =C! = -CÝ = -CT = 41°77 (2.292) 


in the Pauli-Dirac (+i7?7°) and chiral (—i7y?7°) representations. The off-diagonal nature 
of o? accounts for the extra minus sign in (2.190). 


24Majorana masses, which we will encounter in connection with neutrinos in Chapter 9 and supersymmetry 
in Section 10.2, are an important exception. 

25From (2.287) and (2.289) we have that 7 and w° have opposite intrinsic parities under space reflection, 
consistent with the comment following (2.266) and ensuring that C and P commute. 
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For arbitrary fermion fields Ya » (with the same intrinsic phases), 


Cpa WOT = bayt pi = -pa C7 Cpa 


= z = (2.293) 
= tpa be = bey be = — (bar yo)! 
In particular, the electromagnetic current 
Jo = —Ve- Ype- (2.294) 
in QED transforms as 
CJAC! = pet y Wet = He- 4” Ye- = =p (2.295) 


so that the interaction —eAg, JQ in (2.218) is invariant for Ag, > —Ag,. An arbitrary 
fermion bilinear transforms as 


Chal CT! = WET YE = WT cha, (2.296) 
where e 
Te = (CITC) Sere, (2.297) 


The I, are listed in Table 2.2. The spinor identity corresponding to (2.296) differs by a sign, 
i.e., 
Walwy = -r we, (2.298) 


where each w can be either a u or v spinor, and 
u°(p, 8) = u(p, s), v°(p, 8) = u(p, s) (2.299) 


is the charge conjugate spinor. 
The chiral fermion fields transform as 


ChrC* = 4} = Pry, CYRO! = Yh = Prys, (2.300) 


where Y? (Yh) is the field that annihilates a left (right)-chiral antiparticle or creates a right 
(left)-chiral particle. From (2.287) and (2.300) 


pi = Pub =C Va = Pry =C 
pi =-YRC, R= —vic™, 


where, e.g., Y = Myr] = T (Yr)'T. Thus, Yẹ ~ Yh up to Dirac indices. Similar 
results hold for the transformations (2.290) and (2.291) of the chiral spinors. The flip of 
chirality in (2.301) is at first confusing. It is associated with the fact that C does not 
change the spin or helicity, but the relation between chirality and helicity is reversed for 
antiparticles, as can be seen in (2.195). For example, in the massless limit Y$ and wh both 
create positive helicity particles or annihilate negative helicity antiparticles. Note that y” 
and y“7° transform with opposite signs, so that a left-chiral current is mapped onto a 
right-chiral one, 


(2.301) 


Char vor = par Vir = -Vor "par. (2.302) 

Thus, from (2.268) and (2.302) the (V — A) charged current weak interactions violate both 
space reflection and charge conjugation invariance. Similarly, 
ChartorC* = virtir = Hor Var 


au i (2.303) 
Charo’ bor * = piro” WER = piro” par. 
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The relations (2.296), (2.298), and (2.302) are frequently used in conjunction with the Fierz 
identities in (2.215). For example, combining (2.215) and (2.302), 


(parior) (der Yapar) = — (VELI PEL) (ber Yapar) 
a (pery war) (der Yubin) f 


The transformation laws for boson and fermion fields continue to hold in the presence 
of the strong and electromagnetic interactions, which are C-invariant. Even including the 
weak interactions, which violate C, the same relations hold for the unperturbed fields when 
the weak effects are treated perturbatively, analogous to the discussion of reflection non- 
invariance. The spinor identities in (2.290) and (2.291) are valid independent of whether C 
is violated. 

The implications of C-invariance and the related concept of G-parity for strong transi- 
tions and for matrix elements of operators will be discussed in Section 3.2.5 and Appendix G. 


(2.304) 


CP Transformations 


It is straightforward to write Lagrangians that violate P or C, but more difficult to find ones 
that violate CP invariance, at least for a small number of fields, due to the combination 
of Hermiticity, Lorentz invariance, and the fact that the overall phases of physical fields or 
states are not observable in quantum mechanics. In particular, one can always choose the 
phases of the fields so that any given term in a Lagrangian is manifestly CP invariant. CP 
violation can therefore only emerge when two or more terms clash and cannot be invariant 
simultaneously. Note that C and P commute. 

To make this more explicit, the P and C transformations of spin-0 fields ¢ and generic 
spin-1 fields W” defined in the previous sections always imply 


(x) —> nig (a), W(x) ae my Wila’), (2.305) 


where a’ = (t,—Z), ng and nw are products of C and P phases, and the f are omitted for 
Hermitian fields. Similarly, 


Yla) > PH) = =C a’) 


* 0, 1 * tT 1 (2.306) 
WL,R(x) rai Ny Y PRL) = =C Yr R(E ). 


Thus, CP maps w~z,r onto its own Hermitian conjugate. In contrast, P and C each maps 
a chiral fermion field onto a different one, YL, R P WR L, WLR F vhr The elementary 


fermion bilinear forms Ypa(£) y» (x) for the T’s in Table 2.2 all satisfy 
T * vA p, * T j 
dala) Pvo(x) > Maole T Yala") = nrav [Val Ela") (2.307) 


where nžap = Manner, with nr = +1 for I = 1 or 7°, and yr = —1 for T = 74, 944°, oh”, 
and o””47*. It is understood that Lorentz indices in I are lowered in the second and third 
expressions. Equation (2.307) can be easily verified from the transformations in (2.210), 
(2.267), and (2.296), along with Table 2.2. The upshot of (2.305) and (2.307) is that any 
Poincaré invariant term £;(x) with field content such as 


lotel’, 3 gt OndsW", atope, Paupo W”, (2.308) 


as well as any mass or kinetic energy term, is mapped onto its Hermitian conjugate evaluated 
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at x’ up to an overall phase £;(x) a me Ll (a! ). The phase n; depends on the intrinsic CP 


phases 7¢,w,) and on any complex coefficients (which are not complex-conjugated under 
CP, but are in Li). The intrinsic phases can always be chosen (or the fields redefined) so 
that n; = 1 for any given i, and therefore the Hermitian combination £;(x) + LÌ (£) TP 


LÌ (x") + Lila’) (or just Lile) a L(x’) if it is Hermitian). CP invariance of L results if 


this can be done for all of the terms simultaneously. 
Let us illustrate by an example involving two complex scalars ¢, and ġa, with 


L= Ý (lOudr|? — mler?) — V (ba, os): (2.309) 
r=a,b 
For the special case 
V = gav0bads + Gh or: (2.310) 
one has 
V => gamma, OLON + Gana baby (2.311) 


The model is CP invariant even if gabb = |gapp|e’* is complex because we have the freedom 
to choose the CP phases na p. For example, L(x) — L(x’) for the choice Na = exp(2ia) and 
m = 1. This can be cast in a simpler form by a field redefinition, in which one rewrites £ 
in terms of a new field ĝa = Qa exp(ta@), so that 


V = |gav0 (bag? + diol’) (2.312) 


with the other terms unchanged in form. This is invariant for Ĥa = me = 1 (for which the 
transformations of ¢, and ĝa are consistent). On the other hand, the potential 


V= JarbPabi, + Jaavb Pb + deine + h.c. (2.313) 


with complex coefficients is not CP invariant in general because the two C’P phases na, are 
not sufficient to make all three terms invariant (except for special cases such as gaaa = 0 or 
all phases = 0). Equivalently, one can perform two phase redefinitions of the fields, but that 
is not usually enough to make three coefficients real. This example illustrates the origin of 
CP violation in the standard model, in which there are not enough quark field redefinitions 
to remove all of the phases from both their mass terms and the charged current weak 
couplings simultaneously. 


Time Reversal and CPT 


Under a time reversal (T) transformation, classical trajectories are replaced by their time- 
reversed ones, 


Ht) > a-t), PO p- IH) > -F(-0). (2.314) 


The electromagnetic current and gauge potential satisfy 


Joule) > IG"), — Agu() > AB"), (2.315) 


where x” = (—t,#), so that E(x) > E(x") and B(x) > —B(x"’). Time reversal is more 
complicated in quantum mechanics. In analogy with (2.314) one defines a time reversal 
operator T so that 

Tx,T | = ti, Tp,T 1 = —p;, (2.316) 


Review of Perturbative Field Theory 59 


where x; and p; are the position and momentum operators. However, consistency with the 
canonical commutation rules [x;, pj] = iĝi; (or with the Schrödinger equation) requires that 
T is antiunitary, i.e., Te T71 = c* for any c-number c. This extra complex conjugation signif- 
icantly complicates many calculations. The antiunitarity is also required for the consistency 
of the canonical commutation relations in field theory. A consequence of antiunitarity is 
that 

(a|O|b) = (Ta|TOT~*|Tb)*, (2.317) 


where © is an operator and |Tb) and (Ta| are time reversed states. For example, 


T|a(p, 8)) = |Ta(p, s)) = |a(—p, —8)), (2.318) 


for a single particle state with spin orientation s measured with respect to a fixed axis. For 
multiparticle states, T also interchanges initial (in) states with final (out) states. 
Invariance of the action under time reversal requires TL(x)T~ + = L(x"). This is ensured 
for free spin-0 fields for 
T4(x)T-? = nrod(2"), (2.319) 


where ¢ can be either Hermitian or complex, and nrọ is a phase. (In the complex case 
o = (¢1 + id2)/V2 this requires np, = —nr2 for the Hermitian components ¢; and ¢9.) 
For an electrically charged complex field, (2.319) implies that (2.315) is satisfied for the 
contribution 


: = : =, 
JE (a) = igo! (x) 0" Ox) = —igd! (x") Iela") = JG, (2") (2.320) 
since 04 = —O/. Similarly, 
TW*(2)T—* = nrwW,(2") (2.321) 


for an arbitrary Hermitian or complex spin-1 field. 
For a Dirac field, invariance of the free-field action or the requirement (2.315) for Jọ 
implies 
Tyla) T} = nry Tyla"), (2.322) 


where 7 is a Dirac matrix satisfying 
TYPT =T = h. (2.323) 


In the Pauli-Dirac and chiral representations 


T=T =T =-T* a1: (2.324) 
A fermion bilinear therefore transforms as 
Tala) Eya (£) TT! = pale”) Eip (2”), RSTI, (2.325) 


assuming the same T-phases for Ya». The values of T, are listed in Table 2.2. Implications 
of T-invariance for matrix elements will be discussed in Appendix G. The u and v spinors 
transform as 


Tulp, s) = u(—p, —s)*e**, To(p, s) = v(—p, —s)*e"*, (2.326) 
where a and £ are phases that depend on s. For the conventions in (2.184) and (2.192), 
exp(ta@) = exp(i3) = Fi for s in the +2 direction. 


The CPT theorem, i.e., (CPT)L(x)(CPT)~! = £L(—2) for a Hermitian, local, Poincaré- 
invariant £, can be demonstrated heuristically by an extension of the discussion following 


60 The Standard Model and Beyond 


(2.308). There it was argued that CP maps the field part of each term onto its Hermitian 
conjugate, up intrinsic CP phase factors, but does not complex conjugate the coefficient. 
CPT does conjugate the coefficients, and is such that the product of intrinsic phase factors 
can always be chosen to be +1. To see this, by combining the transformations for T, P, and 
C, we find that 


Plr) —— feble), — Vale) => Ôv Vpl) 


CPT CPT Ga 
ð ð i 
— Tif? d(—x)tF = 
ue) cpr | le? oa ae Oxt (=x!) 


where the upper (lower) sign is for the Pauli-Dirac (chiral) representation, and = nrnpné. 
Analogous to (2.307), the fermion bilinear forms transform as 


7 Pe i 
Pale) Yole) —> frat [Palale], (2.328) 
where rab = Až ahr, with Ar = 1 for T = 1,95, 0%”, or ot” 7°, and fr = —1 for T = 7" or 
7°. Therefore, if one chooses the phases so that 74 = +1 for all ¢, fy = —1 for all V, and 
fy = —1 for all y every Poincaré invariant term in L(x) will be mapped onto its Hermitian 


conjugate evaluated at —z, leaving the action invariant. (Any universal value for 7, would 
suffice for a fermion number conserving theory, but Ĥy = real is required for manifest CPT 
invariance in the presence of Majorana mass terms.) More rigorous derivations of the CPT 
theorem may be found in, e.g., (Streater and Wightman, 2000; Weinberg, 1995). 

The CPT theorem implies that the mass, intrinsic properties, and total lifetime of a par- 
ticle and its antiparticle must be equal (see, e.g., Sozzi, 2008). However, CPT does allow 
partial rate asymmetries (Okubo, 1958) if CP is violated, i.e., the decays rates T (a — 6; 2) 
for a decay of a into bı or b2 can differ from the corresponding antiparticle decay rates 
I'(a° — bf 2), provided that the sums of the partial rates are the same. This is impor- 
tant for many models of baryogenesis (Chapter 10). Experimental searches for CPT and 
Lorentz invariance violation are reviewed in (Kostelecky and Russell, 2011; Liberati, 2013; 
Patrignani, 2016). 


2.11 TWO-COMPONENT NOTATION AND INDEPENDENT FIELDS 


Weyl two-component fields (or Weyl spinors) were briefly introduced in (2.200) and the 
subsequent discussion. The description of the discrete symmetries for fermions is somewhat 
simpler when reexpressed in the two-component language. First, consider the case of a four- 
component Dirac field ~, written in terms of two-component Weyl L and R fields as in 


(uy 
(2.200), i.e., Y = ( T 


—io? 0 0 —io? o 0 
C= ( 0 in) , cy = o 0 ) , T=- ( 0 I ’ (2.329) 


the transformations under P and C in (2.263) and (2.287) become 


). Using the explicit chiral forms 


Ur rz) > Vaca’),  Vrr(z) > Vir), (2.330) 


where x’ = (t,—7 ) and we have set npy = ncy = +1. The charge conjugate Wey] fields are 
defined as 
R =i Ve, WE = —io? Yh, (2.331) 
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where * is a shorthand for fT, i.e., 
Uh = YEr = (Wz,n)™ (2.332) 


is a column vector with components that are the adjoints of those of Vz pr. Equations (2.330) 
and (2.331) require that WF p(x) = -ÜR (7), i.e., the fields and their charge conjugates 


have the opposite intrinsic parity. Both P and C map one Weyl field onto another, e.g., Yz 
is mapped unto Vz or Yh. On the other hand, CP maps a Wey] field onto its own adjoint: 


Prz) —> V(r) = +io? VF (2’) 
ad (2.333) 
r(t) > Vi (a") = -io? UR(2'), 


with the io? acting like a raising/lowering operator on the helicity indices (cf. Equa- 
tion (2.191)). Thus, the CP transformation always exists, even in a theory involving a 
single Weyl field (and independent of whether the Lagrangian is invariant). On the other 
hand, P and C are only defined when one has both Yz and Vp. 
Under time reversal, 
Wr r(x) = —0° Y; R(x"), (2.334) 


where z” = (—t, 7 ) and we have used (2.322) with nry = 1. Combining (2.333) and (2.334) 


Uz r(£) TP Fitr R(—2)", (2.335) 


consistent with (2.327). 
Fermion mass and kinetic energy terms are expressed in two-component notation in 
(2.201). Some important fermion bilinear forms become 


pay” po = Ui oy, + tl o Vor 
pay vo = -vt a" Wor + Ul, o War 
Pao = Vİ Wor + Vip Wor 
Payo = Ut Vor — Up Vor (2.336) 
T= 
zhao Wo = Yİ 5 Der + Ul ps” Voy 


1- SV v 
zva” he = vi, SY’ Wor — vi ps” Wo, 


where o” and g” are defined in (2.167), and 
i 


s= 
4 


(cta” — oa"), giv =! gugr _ 3”o") (2.337) 


i 
4 


gi = g — ia sf = 48 = Heigo". (2.338) 


The transformations of the bilinear forms under the discrete symmetries can easily be 
rewritten from those given in Section 2.10 and Table 2.2. Especially useful are the charge 
conjugation identities such as 


Wi Wen = Ul Var, wi swe, = -vipo War, (2.339) 
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analogous to (2.302) and (2.303). The Fierz identities for the two-component fields can be 
read off from (2.215) or derived directly from Problem 1.1. For example, 


(viart) (wl. o.War) = (viata) (virant) 


(2.340) 
(virot tin) (WlonYar) = -2 (irta) (Titr). 
Combining (2.339) and (2.340) one finds identities such as 
(virat) (Wiranta) =2(UshYar) (Witir): (2.341) 


As a very simple example, let us repeat the calculation of the amplitude for 


e` (pi)e*(p2) —> f(p3) f(pa) given in Section 2.9 one more time, using two-component 
notation. The QED interaction for the e7 is 


L= -eA JE = +eA, (wlLonw, Uho"' Vr) (2.342) 


and similarly for the other fermions, so that 


M (=) = “FF, 8)F ho" WalO)(OW ho Wales (1e® (2) 
AO i (2.343) 
M (-+, +=) = — E (FB) F (ITa W110) (0/U 0, trle Lee (2)) 


in an obvious notation. Using the free-field expression in (2.207), the amplitudes become 


M (+=, +) = ~iQ se? [p+ 8) 0" (-x-(4)) (—x-(2)) Fob (1)] 
M (—+, +—) = —iQ se? [b_(3)'o*(+x4(4)) (x-2) o up+(1)], 


which reproduces (2.245) and (2.246). 

We will mainly utilize the two-component notation in connection with neutrino masses 
and for displaying the interaction terms in a supersymmetric theory. Much more extensive 
expositions, including such topics as Feynman rules and propagators in two-component form 
are described in the books on supersymmetry and in (Dreiner et al., 2010). A more compact 
version of the two-component notation will be introduced in Chapter 10. 


(2.344) 


Independent Fermion Fields 


It was emphasized in (2.196) that the left- and right-chiral projections Yz, R = Pr rw or their 
associated Weyl fields U7, rz could be considered as independent degrees of freedom. However, 
from (2.287) and (2.301) it is clear that the conjugate fields Y° and YẸ p are not independent, 


but are related by Y° ~ t and YL, RY Yh z- In many cases it is convenient to work in terms 
of the Wr,r (or YzŁ,r) fields, as is conventionally done for QED, quantum chromodynamics, 
and most treatments of the standard electroweak model. However, it is sometimes easier to 
work in terms of the L-chiral fields for both the particles and antiparticles, i.e., Yz and WF 
(or Yz, VF) instead, regarding the Yr and Yh as dependent. This is typically done when 
discussing grand unification or supersymmetry, for example. To illustrate this, the QED 
electromagnetic current Jo can be written in a number of equivalent ways, including 


J = — bey He = Heey” Wee 


f i È 7 (2.345) 
= —WeLry Wet 7 WeRY PeR = Ype Y Wet + PecL Y” YecL, 
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where Ye and pec = WS are, respectively, the e~ and e* fields. In two-component notation 
Jb = WE a" Wer — Ul po Wer = -Vi 0 Ver, + Vipa Weer. (2.346) 
Similarly, a fermion mass term can be reexpressed as 
-L = mpy =m (dre + devs) =m (brede +Y Cour) 


2.347 
=m (WLR +ut, wr) =m (vivy = viot) , poan 
where we have used (2.301) and (2.292). The two forms in the second line emphasize two dif- 
ferent interpretations of a Dirac mass. The vi W_, expression, for example, can be viewed as 
annihilating an D -chiral field and creating an R-chiral one, while the equivalent UF io? Y L 
form can be interpreted as the annihilation of two distinct L-chiral fields. 


2.12 QUANTUM ELECTRODYNAMICS (QED) 


Quantum electrodynamics represents the merger of three great ideas of modern physics: 
classical electrodynamics as synthesized by Maxwell, quantum mechanics, and special rela- 
tivity. The basic formulation of QED was completed by 1930; it combined the Dirac theory 
of the electron (with its correct predictions of the lowest order electron magnetic dipole 
moment and the existence of positrons) with the quantization of the electromagnetic field 
into individual photons. A workable prescription for handling the divergent integrals by 
renormalization of the electron mass, charge, and wave function was completed by the early 
1950s by Bethe, Feynman, Tomonaga, Schwinger, Dyson and others (Schwinger, 1958). 
QED therefore became a mathematically consistent and well-defined theory, which was 
subsequently tested to incredible precision. 

In order to consider QED seriously, one must include higher-order (loop) effects in per- 
turbation theory, both because of the precision of many of the tests and because some effects 
(such as light by light scattering via the interaction of photons with virtual charged particle 
loops) only occur at loop level. The calculation of higher-order effects is greatly compli- 
cated by divergences and the need to renormalize (while maintaining gauge invariance). A 
systematic study is beyond the scope of this book, and only brief comments are made. The 
subject is studied in detail in standard texts on field theory. Another complication is that 
strong interaction effects are very important and cannot be ignored in studying the electro- 
magnetic interactions of strongly interacting particles (hadrons), such as protons, neutrons, 
and pions. They even enter at higher orders (through loops involving virtual hadrons) in 
the electrodynamics of electrons and muons. Fortunately, a great deal can be said about 
these strong interaction corrections using symmetry principles. 


2.12.1 Higher-Order Effects 


Analogous to the Hermitian scalar case in Section 2.3.6, QED has logarithmically divergent 
diagrams such as those shown in Figure 2.17. To take this into account, let us modify the 
notation in the QED Lagrangian density in (2.218) to 


ee 1 : 
L = y(x) (i Ø + e04 — mo) w(x) — gE F" 
7 1 (2.348) 
= W(x) (i Ø = mo) w(x) = eoJ A, = geme", 
where Jô = —wy"'w is the electromagnetic current operator defined in (2.294) and 7 is 


the electron field. eg and mo are the bare positron charge and mass, respectively, i.e., the 
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Figure 2.17 Electron-photon vertex (top left), and one-loop corrections, correspond- 
ing to the vertex correction (top middle), the vacuum polarization (or photon self- 
energy) correction (top right), and electron self-energy corrections (bottom). 


parameters that appear in £. We redefine e and m as the physical parameters, i.e., the 
ones actually measured. The renormalizability of QED implies that the divergences enter 
only in the relations between the bare and physical parameters and in the unobservable 
wave function renormalization of the fields,?° and that observable quantities are finite to 
all orders in perturbation theory when expressed in terms of e and m. 

The one and higher-loop vertex corrections imply that the lowest order electron-photon 
vertex iegy" is replaced by a function ieo I” (p2, p1)/Z, that can depend on the external 
momenta. Z; is the (divergent) vertex renormalization constant, defined by the requirement 
that T = 7“ at the on-shell point p3 = p? = m? and q? = (p2 — p1)? = 0. Similarly, the 
electron self-energy and vacuum polarization diagrams modify the electron and photon 
propagators so that on shell they take the same form as the free ones except they are 
multiplied by divergent wave function renormalization factors Z2,3, 


Zə 


> 23 
p—m+ie’ 


Sr(p) igh” Dr(q) > —igh” ee 


(2.349) 


where the position of the pole in S(p) defines the physical electron mass. It is convenient 
to associate a factor of Z /2 or Z; /2 from each electron or photon line with the vertex. The 
other Z!/? is associated with the vertex or external state at the other end of the line. The 
overall vertex factor therefore becomes 


1/2 


ZZ 
ieog" > io =Ma) (2.350) 
1 


26Īn addition to the ultraviolet divergences (associated with large momenta in the integrals) discussed 
here, there are also infrared divergences associated with low momentum virtual photons. These can be 
regulated by introducing a fictitious photon mass; they cancel against similar terms associated with the 
emission of soft real photons at energies below the threshold of the detector. 
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The renormalization factors Z1,2,3 can be combined with eg to define the physical charge e. 
The Ward-Takahashi identity ensures that Zı = Z2 to all orders, so we can identify 


e=e = ey Z4!”. (2.351) 


It can then be shown that I“(p2,pi) is finite to all orders when expressed in terms of 
e and m, and in particular, [“(po,pi1) = y + O(a) where a = e?/4r ~ 1/137 is the 
fine structure constant. The charge renormalization depends only on the photon vacuum 
polarization diagrams and is therefore the same for all particles, e.g., for electrons with 
charge —eg —> —e, and for u-quarks with charge +2e9/3 — 2e/3. Similarly, after removing 
the Z2,3 factors, the electron and photon propagators behave like the free-field ones near 
the physical p? = m? or q? = 0 poles, but can have momentum dependent corrections away 
from the physical masses. These can also be shown to be finite to all orders when expressed 
in terms of e and m. The vacuum polarization corrections will be discussed in Section 2.12.2 
in connection with running couplings. 

The relation of such renormalized quantities to physical on-shell amplitudes and matrix 
elements is considered in field theory texts, but it should be plausible from the above 
discussion that the on-shell matrix element of -J8 (2) between physical electron states is 


(Do 82|b(x)y"eh(x)|pi $1) = UI" (p2, p1 Jet? , (2.352) 


where q = p2 — pı and the x dependence follows from translation invariance, Equation (1.15) 
on page 3. The form of IT” (p2, pı) is strongly restricted by symmetry considerations, which 
continue to hold in the presence of the strong interactions. In particular, Lorentz invariance 
implies that the r.h.s of (2.352) must be a four-vector, which can only be constructed from 
pi, ph, and the Dirac matrices. The Gordon identities in Problem 2.10 indicate that it is 
sufficient to consider p{', in the combination q” only. Furthermore, QED (and the strong 
interactions) are reflection invariant. Therefore, using (2.276) the most general allowed 
form is 


_ _ iot” 

tT” (p2, pi)ur = üz |4 Fı (q?) + am We) +q“ F (°) | un, (2.353) 
where the form factors F 1,2,3(¢7) are Lorentz invariant functions of q?. Charge conservation, 
WIG = 0, which can be derived to all orders from the equations of motion or from the 
Noether theorem of Section 3.2.2, requires that q?F3(q7) = 0. One does not expect a 6 
function to develop to any order, so F3(q?) must vanish. Also, the Hermiticity of Jo implies 


T" (p2, p1) = I" (p1, p2), (2.354) 


where T” is the Dirac adjoint. From Table 2.2, F 2 must therefore be real. Finally, we have 
normalized so that F\(0) = 1. 

We have seen that eF; (0) is just the physical electric charge. To interpret F2(0), consider 
the interaction of an electron with a static classical gauge potential A,,(x) (cf., (2.66) and 
Problem 2.24). The matrix element of the interaction Hamiltonian 


Peek J Prone A (2.355) 
between one-electron states is 


(Do sol H|Pi 51) = —eŭI “uy J A, (Z je T "PZ = —etigI“u Å (g). (2.356) 
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Expanding the fermion bilinear in (2.356) to linear order in the non-relativistic limit 
\Di| < m, this reduces (Problem 2.30) to 


ot, (—2meAo(@) + eli +P): AZ) + e l1 + OF BM) bs (2.357) 


where ¢s,. are the two-component Pauli spinors and B is the Fourier transform of the 


magnetic field B =V x A. The conventional non-relativistic interaction Hamiltonian for an 
e in an external field is 


H; = —eA(ž) + = (z ĀE) + A(z) P) — ji. - B(Z) + Ole), (2.358) 


where je = -gug is the electron magnetic dipole moment operator, ug = e/2m is the 
Bohr magneton, S=6 /2 is the spin operator, and g is the electron g-factor, which is not 
predicted in the non-relativistic theory. The three terms correspond to the Coulomb interac- 
tion, and the orbital and spin magnetic moment interactions, respectively. The momentum 
space matrix element of H; (corrected to agree with our covariant state normalization) is 


BaBa m J S: A (2.359) 


with the p operators in H; replaced by the po eigenvalues. This coincides with (2.357) 
provided that one identifies the g-factor as g = 2 [1 + F2(0)], where the 2 is the relativistic 
Dirac contribution (from 7“F\(0)), and F2(0) = (g — 2)/2 is the anomalous QED term, 
usually denoted by ae. The leading contribution, from the vertex diagram in Figure 2.17, is 
F,(0) = a/2z, as first calculated by Schwinger. The calculation is sketched in Appendix E as 
an illustration of the techniques for calculating Feynman loop integrals. The relation of field 
theory matrix elements to non-relativistic potentials is further discussed in Problem 2.33. 


2.12.2 The Running Coupling 


We saw in (2.351) that the renormalization of the electric charge is due to the vacuum 
polarization diagram of Figure 2.17, with the divergent parts of the electron self-energy and 
vertex diagrams cancelling by the Ward-Takahashi identity. Now, let us consider the vacuum 
polarization as a function of q?. The one-loop vacuum polarization diagram in Figure 2.18, 
can be added to the tree level photon propagator to yield (see, e.g., Peskin and Schroeder, 
1995) 


7 =h 2 N2 
~ d a eg i a4 am) (2.360) 
where we have also included a factor of e2 from the outside vertices. A is an ultraviolet cutoff 
of the divergent momentum integral. In practice, the cutoff must be introduced in a way 
that respects gauge invariance, such as in the Pauli-Villars or dimensional regularization 
schemes. II(q?) is a finite function that vanishes at q? = 0, so we can identify the photon 
wave function renormalization factor as 


(2.361) 


2 2 
Z= |1 ĉo] S 


~ r m? 
which diverges for A > oo. The r.h.s. of (2.360) can be rewritten 
2 


—iguv 2 €o 2 —1g uv e 
1+ L Aj 2.362 
e“ | are j @ lel) AR 
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where the corrections to the last form are higher order in e or e*. One can show that this 
form holds and is finite to all orders, provided T(q?) is expressed in terms of e? and m?. 
The e?™”II” term in the expansion (1 — e?IT)~! = 1 + e?II + eII? +--- corresponds to the 
diagram with n separate vacuum polarization bubbles along the line. 
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Figure 2.18 One photon exchange and one-loop vacuum polarization “bubble.” 


This illustrates how the divergences disappear from the expressions for physical observ- 
ables when they are written in terms of the renormalized quantities. However, one may still 
have a nagging doubt about the underlying divergences. In fact, the modern view is that 
the “divergent” momentum integrals are actually cut off physically at the scale at which the 
theory is replaced by a more complete one, and that A should be associated with that scale 
and not taken to infinity. One can then interpret the renormalizations as finite quantities 
describing, e.g., the difference between a coupling as measured at a scale much smaller than 
A and the value it would have at A. A logarithmically “divergent” term in a weak coupling 
theory such as QED is then actually a small effect, e.g., Ea In na ~ 0.08 for eĝ ~ e? and 
A ~ Mp ~ 101° GeV (the Planck scale). Most of the divergences in renormalizable theories 
are of this logarithmic nature,?” and results are therefore insensitive to the details of the 
new physics above A. Non-renormalizable theories, on the other hand, typically encounter 
new divergences of order A?” at n-loop level, and are very sensitive. 

The function II(q?) in (2.362) is 


m? — gz(1—2z) 
2 7 


II(q”) = a f dz z(1 in| (2.363) 


m 
which vanishes as I(q?) —> —q?/607?m? for q? — 0. It is well behaved for q? < 0 (i.e., 
t-channel exchange, as in Figure 2.18), but has a branch point at q? = 4m? associated 
with the ete~ threshold in s-channel processes. In the limit Q? = —q? >> m? (positive for 
t-channel exchange), 

Q? 


II(q”) > pa In ae (2.364) 


The replacement in (2.362) is universal for all photon exchanges. It is useful to introduce a 
running or effective fine structure constant arr (Q?) ~ e?(Q?)/4m = a/(1— 4rall(—Q?)). 
Such considerations are most useful for large Q?, so we will actually define the running 
quantity as 


2 a( 3) 
a (Q?) E (2.365) 
3r 2 


27 The quadratically divergent (but renormalizable) corrections to scalar self-energies, such as the Higgs 
mass-square in the standard model, are a critical exception. This will be discussed in Chapter 10. 
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where Q3 is an arbitrary reference scale, such as m?. This coincides with a, rp for QR > 
Q? ~ m?, and approaches the bare coupling ag = ef /4r for Q? — A? (up to higher-order 
corrections). Equation (2.365) is equivalent to 


1 1 Q? 
a(Q?) al(Q?) 3 


where b! = 4rb = 1/37. Thus, a(Q?)~! runs linearly with In Q? at large Q?. (Higher-order 
corrections to the vacuum polarization bubble lead to small nonlinear effects). The running 
a(Q?) therefore increases logarithmically from its value ~ 1/137 for small Q?, so the QED 
interaction strength should be larger for high energy processes. This was motivated here 
for spacelike momentum transfers q? < 0, but continues to hold even in the timelike region, 
where higher-order corrections are minimized if one uses a(|q?|). Equivalently, the effective 
Coulomb interaction scales as a(Q ~ 1/r)/r, and therefore the effective œ increases at 
smaller separations. There is a simple interpretation of this: an electron charge in a dielectric 
medium is screened for larger separations, leading to an interaction strength falling faster 
than 1/r. At small separations, however, the screening is less effective and a test charge feels 
the full electric charge. The same mechanism applies here, except the dielectric is really due 
to the quantum fluctuations of the vacuum, as represented by the virtual ete~ loop. We 
will see in Chapter 5 that the gluon self-interactions in quantum chromodynamics have the 
opposite effect of antiscreening (for which there is no simple classical analog), leading to a 
decrease in the strong coupling at large momenta or short distance (asymptotic freedom). 
The running of a(Q?) is described by the renormalization group equation (RGE) 


da(Q*) 
dln Q? 


(2.366) 


= B(q’) = Va? (Q?) + O(a"), (2.367) 


where b/ = 1/37 is due to the one-loop diagram in Figure 2.18, and the other terms are 
higher-loop contributions to the vacuum polarization bubbles. Equation (2.366) is the solu- 
tion in one-loop approximation. The running in (2.366) is valid for Q? >> m?; there is little 
effect for Q? < m?. However, for Q? >> m3, where m, ~ 200 me is the muon mass and 
Me = m is the e7 mass, one should also include the effect of the muon loop in the photon 
propagator, i.e., b + 2/37, and 


1 2 2. Q? 
AO) ae) ar Te (2.368) 


for Q? > ma. Thus, b changes discontinuously and 1/a has a kink near the particle thresh- 
old. (The exact form of the threshold, including constant terms, whether it occurs at m, or 
2m, etc, depends on the details of the renormalization scheme.) Quark loops also contribute 
to the vacuum polarization and the running. If one could ignore the strong interactions, then 
a quark of flavor r (e.g., u, d, s) would contribute a term 3q?/37 to b’ for Q? > m2, where 
qr is the quark electric charge in units of e, m, is its mass, and the 3 is because there are 3 
quark colors. Thus, 


b= > X Orè, (2.369) 

mr<Q 
where the sum includes both quarks and charged leptons, with C, = 1 (leptons) and C, = 3 
(quarks). Unfortunately, this is a poor approximation for the strongly interacting particles, 
which cannot really be treated as free quarks at low energies. Multiple gluon exchanges 
between the quarks in the vacuum polarization diagram, or hadronic bound state effects, 


Review of Perturbative Field Theory 69 


invalidate the quark part of (2.369). A more reliable approximation can be obtained by 
replacing the perturbative loop by a dispersion relation 


3, Q? °° 


h T? Jire F(Q?, s)R(s)ds (2.370) 


in the 1/a equation. R(s) = o,+--(s)/(4ma?/3s) is the ratio of the total cross section for 
ete — hadrons at CM energy ys divided by the cross section for ete~ —> pty and 
F(Q?,s) is a known function (see, e.g., Eidelman and Jegerlehner, 1995). The low-energy 
part of R(s) can be taken from experiment, while the high-energy part is predicted by QCD. 
One therefore finds that 1/a decreases from ~ 137 for Q ~ 0 to ~ 129 at the the Z mass. 
The extrapolation can be done quite reliably, but the small (O(0.02%)) uncertainty is still 
the largest theoretical uncertainty in the precision electroweak program. Closely related 
hadronic uncertainties are also significant in the interpretation of the measured anomalous 
magnetic moment of the muon. 

The running a(Q?) effect was sketched here in an on-shell renormalization scheme, i.e., 
e was defined in terms of the electron-photon vertex on shell, and m as the location of the 
pole in the propagator. In the minimal subtraction schemes (’t Hooft, 1973; Bardeen et al., 
1978) one defines renormalized couplings and masses at an arbitrary renormalization scale 
u, at which the poles in dimensional regularization are subtracted. Both the masses and 
charges run as a function of u. Higher-order corrections to physical processes involving a 
single large scale Q are usually minimized by evaluating the couplings and masses at u = Q. 


2.12.3 Tests of QED 


Quantum electrodynamics is the most successful theory in physics when judged in terms 
of the theoretical and experimental precision of its tests. A detailed review is given in (Ki- 
noshita, 1990). The classical atomic tests of QED, such as the Lamb shift, atomic hyperfine 
splittings, muonium (jte~ bound states), and positronium (ete~ bound states) are re- 
viewed in (Karshenboim, 2005). More recent results and the experimental values of a and 
other physical constants are surveyed in (Mohr et al., 2012). Measurements of a and of 
possible deviations from QED are listed in Table 2.3. 


TABLE 2.3 Most precise determinations of the fine structure constant a = e° /4r 
and other QED quantities. 


Experiment Quantity Value Precision 
ae = (ge — 2)/2 aT! 137.035 999 157(33) 2.5 x 10710 
h/m(Rb) av} 137.035 999 049(90) 6.6 x 10710 
Solar wind my < 10718 eV — 

CMB Qy < 10735 — 

e hvy Te- > 6.6 x 1078 yr — 


Neutrality of SFs |Qp + Qe-l; Qn < 107? — 


"Q; is the electric charge of particle i in units of e. Detailed descriptions, caveats, and references are 
given in (Mohr et al., 2012; Aoyama et al., 2015; Patrignani, 2016). 
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Figure 2.19 One-loop and typical two-loop diagrams contributing to the anomalous 
magnetic moment of the electron. 


The most precise measurement of œ is from ae = 1.159 652 180 73(28) x 1073, the 
anomalous magnetic moment of the electron, which was measured at Harvard University 
using electrons confined in a Penning trap (Hanneke et al., 2008). œ is extracted using the 
theoretical (standard model) prediction (Aoyama et al., 2015) 


2 3 
aSM = Č — 0.328 478 444 00 (2) + 1.181 234 017 (Ž) 
20 T T 


a4 a\5 (2.371) 
— 1.912 05(84) (5) +8.73(34) (*) p aEW 4 ahot, 


where the first five terms are the pure-QED contributions, involving photons, electrons, 
muons, and taus. A few representative diagrams are shown in Figure 2.19. The one-loop 
term is calculated in Appendix E. The two- and three-loop contributions have been calcu- 
lated fully analytically, while the four-loop (891 diagrams) and five-loop (12,672 diagrams) 
terms require numerical integration of the Feynman parametric integrals, resulting in the 
quoted error in the coefficient.” The electroweak and hadronic contributions involving W, Z, 
Higgs, and strongly interacting particles are estimated to be ~ 0.02973(52) x 1071? and 
~ 1.706(20) x 1071?, respectively (for reviews of the theory, see Czarnecki and Marciano, 
2001; Mohr et al., 2012; Aoyama et al., 2015). 

ae yields the single most accurate determination of a, but to test QED it is necessary to 
compare the values obtained in two or more types of experiment. The second most precise 
involves the measurement of the recoil velocity of ®”Rb atoms after emitting or absorbing 
a photon (Bouchendira et al., 2011). This yields h/m(Rb), which can be combined with 
measured mass ratios and the Rydberg constant to determine a. It can be seen in Table 2.3 
that the values of a~! obtained by the two methods agree within 1.10. Equivalently, one 
can use the 8’ Rb value as input to predict ae, yielding 


aP — gSM = _0.91(0.82) x 1077”. (2.372) 


A number of other, somewhat less precise, values of a~! obtained from measurements of 


ae at the University of Washington using electrons and positrons (which are the same by 
CPT), the quantum Hall effect, the p and *He gyromagnetic ratios using the Josephson 
effect, h/m(C's), and other quantities are also in agreement (Mohr et al., 2012). 

The impressive agreement in (2.372), which involves a heroic calculation of ae to tenth 
order, validates not only QED but the entire formalism of gauge invariance and renormal- 
ization theory. However, one caveat is in order. In interpreting very precise measurements, 


28 The extracted value of a has changed significantly from the first edition of this book due to the correction 
of an error in the calculation of the coefficient of (a/m)* in (2.371). 
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especially in extracting parameters from them, one must always be concerned that possible 
effects from (unknown) physics beyond the standard model might lead to an error. This 
possibility can never be totally eliminated. However, in the case of ae the effects of physics 
at scale M are usually of O[(m./M)?], which is < 1071? for M ~ 1 TeV. Including real- 
istic couplings this is usually very small compared to (2.372). (For further discussion and 
possible loopholes, see Giudice et al., 2012). 

Other basic predictions of gauge invariance (assuming it is not spontaneously broken — 
see Chapter 4), are that the photon mass mą and its charge Q, (in units of e) should vanish. 
The current upper bounds, based on astrophysical effects (the survival of the solar magnetic 
field and the isotropy of the cosmic microwave radiation (CMB)), are listed in Table 2.3. If 
QED were spontaneously broken one would expect electric charge nonconservation, which 
would allow the electron to decay, e.g., into vy (Problem 2.31). The experimental limit 
(from the Borexino experiment (Agostini et al., 2015a)) is Te > 6.6 x 1075 yr. 

The charge assignments of the fundamental fermions are technically arbitrary in QED, 
but it is usually assumed that atoms are electrically neutral, so that Qp = —Qe and Qn = 0. 
(See the discussion in Section 10.1.) Experiments on bulk matter (Bressi et al., 2011) indicate 
that this holds to high precision (assuming Qn = Qp + Qe). 

QED has also been tested at high energies, especially at the ete~ colliders PEP (at 
SLAC), PETRA (DESY), and TRISTAN (KEK), which operated below the Z pole where 
the s-channel photon diagram dominates (Wu, 1984; Kiesling, 1988). While not as precise 
as the low energy tests, these results all confirmed the QED predictions. QED was also 
tested indirectly in the Z pole experiments at LEP (CERN) and SLC (SLAC), and above 
the Z pole at LEP 2, where it was an important ingredient for calibrations and entered 
interferences and radiative corrections. Finally, the running of a with Q? has been confirmed 
experimentally (for a review, see Mele, 2006). 

Despite all of the successes of QED, there are two discrepancies, possibly due to new 
physics beyond the standard model: the magnetic moment of the muon and the proton 
charge radius. 


The Anomalous Magnetic Moment of the Muon 


The anomalous magnetic moment of the muon a,, has been measured to high precision in the 
Brookhaven 821 experiment (Bennett et al., 2006), in which the precession of the u magnetic 
moment relative to its momentum in a storage ring was monitored using the parity-violating 
correlation between the momentum of the decay e* and the y* spin direction (Section 7.2). 
The value obtained was a’? = 116 592 091(54)(33) x 10711, where the two errors are, 
respectively, statistical and systematic. a, is especially important because it is expected to 
be more sensitive to new physics than most of the other probes, with deviations typically 
of O[(m,/M)?], ie, O[(mz/me)*] larger than those for ae. The SM expectation is [for 
reviews, see (Miller et al., 2007; Jegerlehner and Nyffeler, 2009; Mohr et al., 2012) and the 
Höcker and Marciano article in (Patrignani, 2016)] a?“ = af"? + aPW + ale. The pure 
QED part, from diagrams analogous to Figure 2.19, has been calculated to five loops (three 
analytically) (Aoyama et al., 2012), 


QED _ Q = 2 : 
aQPP = + 0.765 857 425(17) (2) + 24.050 509 96(32) (2) 


+ 130.879 6(6 3) 6j + 753.3(1.0) el (2.373) 


=116 584 718.95(0.08) x 107+, 
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Figure 2.20 One-loop electroweak contributions to a, = (g, — 2)/2. 


where the uncertainties in the coefficients are from the lepton mass ratios and the numerical 
integrations. The total in (2.373) uses a from h/m(Rb) in Table 2.3 to avoid theoretical 
correlations between ae and a,,. However, the value obtained from a(ae) differs only by 
O(10~!*), which is negligible compared to the other uncertainties. 

There are additional electroweak and hadronic contributions to a, (Patrignani, 2016). 
The electroweak diagrams include the contributions of the W, Z and Higgs (H) bosons, 
shown in Figure 2.20. The first two diagrams dominate, yielding 


2 
pw. CrM, 


y 8/272 


at one loop, where Gp and sin? @yw are, respectively, the Fermi constant and weak angle. 
Including the two-loop diagrams, which are enhanced by large logarithms, 


1 
a G + -(1 — 4sin? Ow )) ~ 194.8 x 107+ (2.374) 


3 3 


ae” = 153.6(1.0) x 107+. (2.375) 


An original goal of the BNL 821 experiment was to achieve sensitivity to ar However, 
there is a significant contribution and uncertainty from the hadronic contributions ae 


such as shown in Figure 2.21. In particular, the hadronic vacuum polarization diagram 


Y 24 
: had Il 
had vac 
H H H H 


Figure 2.21 Two-loop hadronic vacuum polarization (left) and hadronic light by light 
(right) diagrams. The shaded blobs represent hadronic states. 


yields a large contribution that cannot be reliably calculated in perturbation theory. It is 
closely related to the hadronic contributions to the running of a mentioned in Section 2.12.2. 
It can be estimated from the cross section for e*e~ — hadrons using a dispersion relation 
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similar to (2.370), but with a slightly different weighting function. The integral is dominated 
by the experimentally determined low energy region (such as the p resonance), yielding (e.g., 
Davier et al., 2011) a% ~ 6 923(42) x 107". Although this is quite precise, the uncertainty 
is comparable to the experimental uncertainty in a,. The situation is confused by the fact 
that one can also obtain ahad from hadronic 7 decays, which are related by isospin. Applying 
isospin breaking corrections leads to a value 7 015(47) x 1071, which differs by ~ 1.8ø. 
There are also small but nonnegligible hadronic light by light diagrams (e.g., Prades et al., 
2009) and three-loop vacuum polarization diagrams. The former cannot be directly related 
to experimental data and relies on model calculations. The sum is around 7(26) x 107+. 


Using the ete~ value for a’@¢, one finds the standard model expectation? 
8 m 


a“ = 116 591 803(49) x 10°" = Aa, = a6? — aĵ™ = 288(80) x 107", (2.376) 


a 3.60 discrepancy. Using the 7 decay value of a*® reduces the discrepancy to 2.40. 
SM; 


If one accepts the value of a7” in (2.376) then there is a strong suggestion of new physics 
contributions to a,. For example, in the supersymmetric extension of the standard model 


the diagrams in Figure 2.22 would give an additional contribution that would be significant 


had 
m 


Figure 2.22 New contributions to a, in the supersymmetric extension of the SM. ñ 
tO 


and Č are the spin-0 superpartners of the u and v, while ¥*”" are spin-4 superpart- 


ners of the electroweak gauge bosons and Higgs fields. 


for relatively low masses for the supersymmetric partners and/or relatively large tan 3 (the 
ratio of the expectation values of the two Higgs doublets in the theory). The central value 
of the discrepancy in (2.376) would be accounted for (Czarnecki and Marciano, 2001) if 


msusy ~ 67y tan 8 GeV, (2.377) 


where msysy is the typical mass of the new particles in Figure 2.22. (The Higgsino mass 
parameter u, introduced in (10.148), must also be positive.) 

Another possibility is that a, could be enhanced by vertex corrections involving a dark 
photon (e.g., Pospelov, 2009), a hypothetical light (e.g., 10-100 MeV) gauge boson with 
very weak couplings to ordinary matter. These have been suggested in connection with 
dark matter and can be searched for in many decay and low-energy processes (Essig et al., 
2013; Alexander et al., 2016). 

New experiments at Fermilab and J-PARC (e.g., Gorringe and Hertzog, 2015) are ex- 
pected to reduce the uncertainty in af"? by at least a factor of 2. 


29 All of these estimates follow (Patrignani, 2016). However, other analyses (e.g., Hagiwara et al., 2011) 
yield similar results. The hadronic uncertainties are reviewed in (Benayoun et al., 2014). 
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The Proton Charge Radius 


The proton charge radius R, (defined precisely in (2.399)) can be measured in hydro- 
gen/deuterium spectroscopy, which also determines the Rydberg, and independently in ep 
scattering, yielding the combined average 0.8775(51) fm (Mohr et al., 2012). Recently, a very 
accurate measurement of the Lamb shift in muonic hydrogen (u~ p) has been performed at 
the PSI in Switzerland (Pohl et al., 2010). Muonic atoms are very sensitive to R, because 
of the large u mass and consequent small extent of its wave function. Surprisingly, Rp was 
determined to be 0.84087(30) fm, some 7c lower than the e~ value. So far, no one has found 
a plausible theoretical or experimental explanation for this discrepancy in the context of 
QED or the standard model. The muonic Lamb shift and a, suggest the possibility of some 
new physics that especially affects the u, such as the exchange of a dark photon or other 
light particle with weak couplings. However, the Rp anomaly is much larger than the a, one, 
and it is nontrivial (but not impossible) to account for both simultaneously while satisfying 
other constraints. All of these matters are reviewed in (Pohl et al., 2013; Carlson, 2015). 


2.12.4 The Role of the Strong Interactions 


Many QED processes involve strongly interacting particles. Strong interaction effects com- 
plicate the calculations, but in simple enough cases one can still say a great deal using 
symmetry principles. 


The Pion Electromagnetic Form Factor 


Let us start by revisiting the e~7* — e ~7* scattering amplitude in Figure 2.15 and 
Equation (2.222). From (2.133) and (2.218) the relevant Hamiltonian interaction density is 


Hy = -Lr = eh A, — PA’ od with JG = —py"y + idt Be, (2.378) 
where w and ¢ are, respectively, the e~ and r*+ fields. The amplitude is schematically 
Mji ~ (e rtIT Ii f Hada, -i | Ha(a!)aa! Jena) 
~ (~ie)? (e7 (ka, 82) Jb le7 (Æi, 81)) (OT (An, Av)|0) (a (P2) J6 lrt (51)) 


fe 7 —i v = v > 
~ alk, sa) s1) ( a ) HIJA PD), (2.379) 


where q = p2 — pı and the details of the Fourier transforms and momentum-conserving 6 
functions are not displayed. In the absence of strong interactions, one can replace J6 in the 


last expression by id! Fre where ¢ and ¢' are free fields, yielding 
(T+ (B2)|JG()|n* (P1)) = (p2 + p1)” ett? (2.380) 


In the presence of the strong interactions Jô is still defined by the expression in (2.378). 
However, the fields are no longer free, but rather are Heisenberg or interaction picture fields 
w.r.t. the strong interactions. For example, consider the interaction terms 


Len = iV 295 (Po Undes + Paota) —4A(dtidar) » 381) 


where Yp, Yn, and ¢,+ are, respectively, the proton, neutron, and zt fields. Lry corre- 
sponds to the interaction vertices shown in Figure 2.23, and represents a subset of those in 
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Figure 2.23 Top: interaction vertices described by (2.381). Middle: representative 
strong interaction diagrams contributing to (m*|J6|m*) involving nucleons, pions, 
and the p? resonance. Bottom left: diagrams in the quark model description in 
which the z+ is a ud bound state. Bottom right: parametrization of the strong 
interaction effects by a form factor, represented by a shaded circle. 


the isospin conserving model described in Section 3.2.3. (The m7 field is ¢,- = play while 
the 7° and such states as the p resonance can easily be added.) An alternative and more 
fundamental description is to regard the 7+ as a bound state of a u (charge +2/3) and d 
(charge +1/3) quark and antiquark. In practice, one cannot treat the strong interactions 
perturbatively using either description because the couplings gy and A, or the vertex relating 
the bound state 7+ to the quarks, are too large. However, one can parametrize our uncer- 
tainty by writing matrix elements such as (7*|Jg|7*) in a way that reflects the symmetries 
of the theory and that involves one or more form factors, analogous to those introduced for 
higher-order QED effects in Section 2.12.1, for the residual unknowns. These form factors 
can be measured experimentally, computed using non-perturbative lattice techniques, or 
estimated in other theoretical models. 

Consider (1* (p2)|J6(a)|7* (p1)). By an argument similar to the one in Section 2.12.1, 
the most general form is 


(n+ (p2)|JG(x)|7* (p1)) = [0 + pi)" fS (a) + (p2 — pi)" fE (| s7. (2.382) 


The x dependence follows from translation invariance, while the only available Lorentz four- 
vectors are (p2 + pı)”. The form factors oe can depend on q?, which is the only Lorentz 
invariant available since the pions are assumed to be on shell (they could also depend on 
Ps if we extended the discussion to off-shell pions). The electric charge, in units of e, is 
f2(0). In the absence of strong interactions we would have f?(0) = 1 and f?(q?) = 0. 
Since the strong interactions conserve electric charge, we expect that fe (0) = 1 to be 
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maintained, i.e., that the electric charge is not renormalized. If this were not the case, 
the «+ or proton electric charges would differ from those of the e+ due to the strong 
interactions. The nonrenormalization will be demonstrated in Section 7.2.4. Furthermore, 
current conservation,*” 0,,J6 = 0, requires f2(q@) =0, analogous to F; = 0 in (2.353). 

FEP) is known as the electromagnetic form factor of the pion. Although f£ (0) = 1, 
the strong interactions (and higher-order QED effects) are expected to induce a nontrivial 
q? dependence. For small q? one can expand 


1 
PRP) ~ 1+ GPa, (2.383) 
where Rr is known as the pion charge radius. The term is motivated by non-relativistic 
quantum mechanics. Suppose the 7+ were a bound state of a non-relativistic constituent 
with charge +e bound in a potential with wave function w(z) (this roughly but not exactly 
mimics the effects of ud constituents). Then the non-relativistic analog of FEP), which 


plays a similar role in pion scattering from a static field, is 
F (g) = J ËZ eT yl). (2.384) 


It is obvious in the non-relativistic case that F (0) = 1 by the normalization of the wave 
function, i.e., charge is not renormalized by the bound state effects. For small |q|?, and 
assuming a spherically symmetric wave function (which is reasonable since the pion spin is 
0), it is straightforward to show that 


F(@)~1- Tra with R? = féziePw@e. (2.385) 
so R, is the RMS charge radius. The non-relativistic approximation is not really valid, but 
the motivation for the terminology remains. 

ie) can be measured in the spacelike region q? < 0 by scattering pions from an 
atomic target, or in the reaction e~p — e~ ntn, in which the e~ scatters from a virtual m+ 
emitted by the proton, working in a kinematic region where the exchanged pion is as close 
as possible to being on shell. I can be approximated for small |q?| by 


1 
RÒ ~ r 


6 


(2.386) 


with \/R2 = 0.672 + 0.008 fm (Patrignani, 2016). It can be measured in the timelike region 
for q? > 4m? by ete” — ntr”. For large q? it falls off rapidly as 1/q?, while for lower q? 
it is dominated by resonances, especially the p°, which is a spin-1 resonance with m, ~ 770 
MeV and a width I, ~ 150 MeV. Near the p the form factor assumes a classic Breit-Wigner 
resonance form (Appendix F), which to first approximation is 


m4 


UP? w p . 2.387 
lft (@)| (ma)? 4 ner? (2.387) 


Corrections to this formula are given in (Gounaris and Sakurai, 1968) and plots in (Am- 
brosino et al., 2011). 


1> 
30 As mentioned in Section 2.6, the conserved current is actually iġt 8# — 2eġt AX. This is irrelevant 
here as we are working to lowest order in e. 
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The Proton and Neutron Form Factors 


Now consider the elastic scattering process e7 p + ep. Similar to (2.378) and (2.379), the 
electromagnetic interaction is 


Lr= -eJQA, with JQ = pey" pe + Yp" tp, (2.388) 


where pe and Yp represent the electron and proton fields, respectively, and the lowest order 
amplitude can be written 


Mp = ie ū(ka)y"ulkı) (=) (—ie)(p(P2)|JQ|p(P1)), (2.389) 


where q = p2 — pı and the spin labels are suppressed. Let us first treat the proton as a 
pointlike elementary particle. Then 


(p(P2)|Jo|p(P1)) = U(P2)7" uP). (2.390) 
The cross section in the lab frame, i.e., the proton rest frame, is 


0L Og Oy 
2? : 2.391 
cos“ = ame sin’ > |) ( ) 


dr 4k? sin* %& k 


da a? ko 
1 


where we have neglected the electron mass, the proton mass is mp, Oz is the angle of the 
scattered electron, dQ; = dydcos@, — 2ndcos@z, the final electron energy is given by 
(2.46), 

k2 1 1 


ky 1+ (l= coer) E 1+ TR sin? f) 


(2.392) 


and q? = —4kı kə sin? om 

When we turn on the strong interactions, we must include the full matrix element of JG 
in (2.389). This is denoted by the shaded circle in Figure 2.24, which also contains some 
representative strong interaction corrections to the vertex, both in the approximation of 
treating the virtual states as physical hadrons, or in the quark model, for which the proton 
is a uud bound state. Just as for the pion, these effects cannot be summed perturbatively. 
They must be expressed in terms of form factors that parametrize the matrix element in a 
way that takes into account the symmetries of the theory, and that can be determined by 
experiment, by model calculations, or by lattice or other techniques. After making use of 
the Gordon identities in Problem 2.10 the most general Lorentz and translation invariant 
matrix element is 


(p(B2)| JG (a) |p(1)) = U2) TG (aup) et, (2.393) 
where 
pe wre 2), OY pr oy uppi 
Qla) = [FE ) + 5a Fa) + "FS (7) 
P 
jovas (2.394) 


+ yey? gP(q?) + ahl) + a7°98 (4°) | , 


2Mp 
where the form factors F” and g? depend on q?. We have so far not imposed additional 
symmetries such as reflection invariance, so the discussion would be valid even when weak 
and higher-order electromagnetic effects are included. Similar to the discussion in Section 
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Figure 2.24 ‘Top: tree-level amplitude for e~p — e` p for a point proton and with 
proton form factors (shaded circle). Bottom: representative strong interaction cor- 
rections in terms of virtual hadrons (three left diagrams) or in terms of quark 
constituents (right). 


2.12.1, the proton electric charge is eF'(0), so the nonrenormalization of charge (Section 
7.2.4) implies F?(0) = 1. xp = F$} (0) is the anomalous magnetic moment of the proton, i.e., 


lip = +9pHN Sp, Ip = 2(1 aF Kp), (2.395) 


where uy = e/2my is the nuclear magneton. The experimental value gp ~ 1.79 is very much 
larger than the electromagnetic one-loop contribution of a/27, indicating the importance 
and nonperturbative nature of the strong interaction corrections. The value, and also the 
ratio of the proton and neutron magnetic moments, can be approximately understood in 
the non-relativistic quark model (see Problem 3.22). Electromagnetic current conservation 
further implies q,,u(2)1'@(q)u(P1) = 0, and therefore 


FP(q?)=0, — 2mpg?(q?) + Pl) = 0. (2.396) 


From the discussion in Section 2.10 and Appendix G, g?(q”) 4 0 requires the viola- 
tion of space reflection invariance since Jô transforms as a vector. I 3 # 0 would require 
the violation of charge conjugation invariance as well. Since the weak interactions violate 
P and C they are expected to generate a nonzero 3 subject to (2.396), known as an 
anapole moment, at some level, e.g., due to W or Z exchange across the photon vertex 
or a parity-violating scalar correction to the pion-nucleon coupling. The anapole moment 
has been observed in parity violating transitions in the 133C's atom (Haxton and Wieman, 
2001; Haxton et al., 2002) at the expected order of magnitude, though the calculations 
are difficult and model comparisons with other forms of hadronic parity violation are not 
in perfect agreement. A nonzero value for g5 4 0 would require both space reflection and 
time reversal violation, and would correspond to an electric dipole moment (EDM) for the 
proton (see Problem 2.30). P is violated by the weak interactions, but T violation is much 
weaker, especially for the light fermions, so EDMs are predicted to be extremely small in 
the standard model, as discussed in Section 8.6. However, many types of physics beyond 
the standard model predict EDMs much larger than in the SM, so experimental searches 
for the neutron, electron, atomic, or other EDMs are extremely important. 
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Let us now restrict our attention to strong interaction corrections to TŻ, so that 
H Le EP (2 iof” PI 2 
Tola) = Fi) + om er? (a)l (2.397) 
p 


with FP (0) = 1 and F} (0) = kp, analogous to the higher-order QED corrections to the e7 
vertex in (2.353). For q? < 0 it is conventional to define Q? = —q? > 0, and to define the 
Sachs, or electric and magnetic,®! form factors 


Gela) = FR?) - TF’), Gul?) = FP (0°) + FR (0), (2.398) 


where 7 = Q?/4m* = —q° /4m2, so that Gz(0) = 1 and Gy(0) = 1+«p. The proton charge 
radius R, is defined by 


(2.399) 


analogous to the pion charge radius in (2.383). 
One can then generalize (2.391) to the Rosenbluth cross section formula for elastic e~ p > 
ep scattering 


dQ, 4k? sin4 Su ky 


Or 2 0L 
<= cos? 5 + 27Giy sin’ > (2.400) 


da a? k2 [2 +G; 
The neutron also has electromagnetic properties due to strong interaction and bound 
state effects, just as a neutral atom has electromagnetic moments due to its internal charge 
distribution. This is most obvious in the quark picture, where it is a udd bound state, but 
can also be seen in the virtual hadron language, e.g., induced by the diagrams analogous 
to Figure 2.24 in which the neutron turns into a virtual pr~. The neutron electromagnetic 
matrix element is 
=> H > -> ME nT iot” Traa > 
(BIE En) = ae) |" FR?) + a FR (@)] u(r), (2.401) 
n 

where the neutron total charge is FP (0) = 0 and F3 (0) = kn = gn/2 ~ —1.91 is the neutron 
anomalous magnetic moment. G'g m and the charge radius Rn are defined as in (2.398) and 

(2.399). It is sometimes useful to define the isovector and isoscalar form factors 


1 
FY, = = (FP, — Fo), F’, = (FR. + Fio), (2.402) 


1 
2 
and similarly for Gaai 

One can extract the form factors Gh m(Q°) or FPa(Q°) by measuring the cross section 
for e~p — ep while varying kı and 0z for fixed Q?, and also by scattering from polarized 
protons (for reviews, see Hyde-Wright and de Jager, 2004; Arrington et al., 2007; Pacetti 
et al., 2014; Punjabi et al., 2015). They are described by an approximate dipole form 

Gh (Q? 1 
Gp(Q’) ~ uQ) 7 (2.403) 
ltp  [1+Q?/Q}] 


where Q2 ~ 0.71 GeV? ~ 18.2 fm~?, which holds to ~10% for Q? < 10 GeV”. The neutron 


31The terms are motivated by the form of the matrix element in the Breit frame (Problem 2.32), which 
generalizes the discussion of the non-relativistic limit of the e~ vertex at the end of Section 2.12.1. 
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form factors are obtained from scattering from nuclear targets, such as deuterium (D) or 
polarized *He, and are approximated by G¥,(Q?)/Kn ~ G%(Q?) for Q? < few GeV’, with 
|G% (Q?)| much smaller (a few %). 

The approximate 1/Q? and 1/Q* behaviors of the pion and proton form factors for 
large 1/Q? are consistent with the expectation (see, e.g., Lepage and Brodsky, 1980) of 
(Q?)--) for a bound state of n constituent quarks based on dimension counting rules 
derivable from QCD. The more rapid falloff for larger n may be thought of as due to the 
greater difficulty for the system to hold together in the scattering. 


2.13 OPERATOR DIMENSIONS AND CLASSIFICATION 


The classification of interaction terms for Hermitian scalars in Section 2.3.1 can be general- 
ized to an arbitrary field theory. In particular, in four space-time dimensions the Lagrangian 
density has mass dimension of four (so that the action is dimensionless), boson and fermion 
fields have dimensions of 1 and 3/2, respectively, and ordinary or covariant derivatives have 
dimension 1. Thus, an operator O* consisting of the product of ng boson fields, np fermion 
fields, and np derivatives will have dimension k = ng + 3np/2+ np, and the coefficient of 
OF in £L must be of the form c,/M*~*, where cp is dimensionless and M is a mass. 

Operators with k < 4 are super-renormalizable, those with k = 4 are renormalizable,°* 
and those with k > 4 are non-renormalizable (higher-dimensional). Examples include 


Super-renormalizable: o, vy, g 
Renormalizable: ("6)'0,6¢, piPy, FaF”, dvd, (p)? (2.404) 
Non-renormalizable: oy od, (Py, (Dto (D olee), (teo. 
Non-renormalizable operators often emerge as a low-energy effective theory approximation 


to a more fundamental field theory, superstring theory, etc., in which heavy fields or other 
degrees of freedom of mass of O(M) are integrated out (Section 8.2.3). 


2.14 MASS AND KINETIC MIXING 


In general, the Lagrangian density for a theory with more than one field of the same type 
(i.e., Hermitian or complex scalar, massive vector, or fermion) may include non-canonical 
kinetic energy terms and off-diagonal mass terms. For example, the Lagrangian density in 
(2.20) can be generalized for n Hermitian fields ¢,,a =1---n to 


LO) = 5 [0,87 Kog- Fmd] — Vi), (2.405) 
where ĝ = (bide vee bn)? is an n-component column vector, K is a real symmetric n x n 
matrix with positive eigenvalues, and 7? is a real symmetric matrix.*? A non-canonical 
kinetic energy term (i.e., K 4 1), which may arise if the theory descends from a more 
fundamental underlying theory, must be put into canonical form prior to quantization to 
maintain the classical relation E? = p? + m?. This can be accomplished by defining new 
fields ¢° = S‘/?OZ¢, where Ox is the orthogonal matrix that diagonalizes K and S is the 


32The term renormalizable is sometimes extended to include k < 4. Super-renormalizable, renormalizable, 
and non-renormalizable operators are also referred to as relevant, marginal, and irrelevant, respectively, 
motivated by their low energy behavior. 

33The rh? eigenvalues must be non-negative to avoid spontaneous symmetry breaking. 
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(non-orthogonal) diagonal matrix consisting of the eigenvalues of K. Then 
uP" KO" b = 8 HT SP ORKO ST PG = O,6°T OMG? (2.406) 


The mass and interaction terms must then be rewritten in terms of these rotated and 
rescaled fields, 


L(G) = 5 [2,9°T 04° — Tm] — VP), (2.407) 


where m? = S-12 OLMOS" and VP(¢°) = Vi(Ox.S—!/26°). We will see examples of 
the effects of this kinetic mixing in Chapter 10. 

The standard way to interpret (2.407) is to perform a second orthogonal transformation 
(or a first one if K was equal to J initially) to diagonalize m?, i.e., 6 = OF ¢°, where 
OT m?Om = m? and må is the diagonal matrix diag(m? m3 ---m?). Then 


6) =F > [Outa — mg] - vr), (2.408) 


a=1 


where ¢, and Ma are, respectively, the at” mass eigenstate field and mass eigenvalue, and 
Vi(¢) = VP(Om¢). The free-field propagator in (2.29) generalizes trivially in this mass 
eigenstate basis to 

bab 


Arav(k) = k2 — m2 + ie (2.409) 
in an obvious notation, so that one can calculate scattering and decay processes in a straight- 
forward way, with the index a (possibly) changing at the vertices but not along internal 
lines. 

The same considerations apply to massive vectors fields. For complex scalars the only 
difference is that the analogs of K and rn? must be Hermitian, and those of Ox and Om 
must be unitary (with T replaced by f). Such unitary transformations also apply to Dirac 
fields in the special case of Hermitian kinetic and mass matrices. Extensions involving y°’s 
(i.e., non-Hermitian matrices) and Majorana fermions are discussed in Problem 3.32 and in 
Chapter 8. 

In some cases, however, it is easier to treat the small elements (usually the off-diagonal 
ones) of m? by mass insertions, in which they are ignored in the calculation of the propa- 
gators but are instead treated as if they were interaction terms. We will illustrate this for 
the case of two Hermitian scalars Pa interacting with a single fermion w, 


L= Ly + LKE 


1 1 7 
gio — made? — mindide +Y [mei + ha] Y, (2.410) 


where Ly is the free-field density for Y and Lg Ezo is the canonical kinetic energy for Pio 
The mass eigenstates are 


ha \ ne | oh 2 cos@ sin 
( d ) j ae a OF | a moa) ¢ va) 
where the expressions for 0 and the mass eigenvalues my are elementary and will not be 
displayed. The interactions of ġa, are given by w [haba + hrdo] Y, where 


( i ) = or ( is | (2.412) 
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The amplitude for WW. —> 34 is then given in terms of the mass eigenstates by 


bee h2 h2 
M = i tigu2ti3uy zt 5 
t-m t- mi 


+(3 4). (2.413) 


Now, let us go to the limit in which mî, is a perturbation, i.e., |m?.| < m7, m3, |m? — må]. 


Then, 
2 


2 2 2 2 Miz 
AY ~ Gao a 2.414 
Ma ~ Mi, My ~ Mə, m- m?’ ( ) 
and the bracketed quantity in (2.413) reduces to 
h? h2 1 1 
H 20hıh : 2.415 
t-m? = h | =a ( ) 


Alternatively, one can treat the m?, term as a perturbation, and work in terms of the 
propagators for f > exchange. Then 


h? h2 2m2,hıh 
E EE z 1 (3 4 4), (2.416) 


t-m? t-m? (t-m?)(t-m? 


M = iP ū4uzüzu1 


where the m?, term is from the mass insertion diagrams in Figure 2.25. Expressing m?, 
in terms of 0 from (2.414) one recovers the result in (2.413) and (2.415), but (at least in 
more complicated examples) with considerably less effort. In addition to the computational 
simplicity, mass insertions allow one to keep track of small effects such as symmetry breaking 
systematically. 


Figure 2.25 t-channel diagrams for ~q2 —> v3, treating the m7, term in (2.410) 
as a perturbative mass insertion. There are additional u channel diagrams with 
(3 © 4). 


2.15 PROBLEMS 


2.1 From (A.7), the Hamiltonian for a free complex scalar field is 


n= fea (SE) (ZB) EO o + mtg], 


Show that 
Pp HA feck pp 
= Taa 2n)32E, Ep [a (P )a(p) + OB) b@)] . 


Assume that H is normal ordered. 
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2.2 The CM differential and total cross sections for the elastic scattering of a Hermitian 
scalar of mass m with potential (2.22) is given in (2.59) and (2.61). Choose m = 0.5 GeV, 
à = 0.3, and «K = 1.2 GeV. Plot the CM differential cross section do/dcos@ in units of 
fm? = 10-76 cm? as a function of cos@ for s = 5m?, 6m?, and 7m?, and plot the total cross 
section as a function of \/s for 2m < ys < 3m. Use any convenient plotting program. 


2.3 Derive (2.64) for the special case mı = m3 = 0 and mz = m4 by Lorentz transforming 
(2.57). Hint: use the fact that ø, s, and t are invariant. 


2.4 Consider the process 1*(p\)17 (p2) > 1*(p3)7 (pa), with pi Æ p3 and po Æ pa, in 
the theory of a complex scalar field with Lagrangian density given in (2.88) with 


Vi = ` lgi. 


As shown in Appendix B, the tree-level amplitude My; is given by 


(27)*5* (p3 + pa — pi — p2) Mpi = pes (psPa| — iVr(o(x), oi (£))|P1P2)- 
Calculate this explicitly using the free-field expression for ¢9, and show that My; = —iA. 


2.5 Consider the Lagrangian density in (2.84) for three non-identical Hermitian fields. 
Calculate the lowest order differential cross section in the center of mass for ġı (Pı )¢2(p2 ) > 
$1(p3 )b2(p4 ) as a function of s and the CM scattering angle 0 for the special case mı = 
mg x m3. 


2.6 Suppose that the interaction potential for a complex scalar field were 
Vr = o4 (¢* + 9") 


(rather than A(¢'¢)?/4), which is not U(1) invariant. Show that charge is not conserved 
and calculate the lowest order amplitude for n* (p1) n+ (p2) > n~ (p3) n7 (p4). 


2.7 Consider charged pion QED with a massive photon, i.e., add the term 4M3 A, A" to 
the Lagrangian density in (2.133) (with (Vr = 0). Assume M4 > 2m. (The photon mass 
term is not gauge invariant, but the model still makes sense at tree level.) 

(a) Calculate the decay rate for y — mtr in the photon rest frame at tree level for an 
unpolarized massive photon. 

(b) Calculate the ++ angular distribution dI'/dcos @ for a polarized photon, where @ is the 
angle between the photon polarization direction in the rest frame and the m* direction. 
(c) Show that one recovers the result in (a) when dI'/dcos6@ is integrated over cos 0. 


2.8 Consider n*(pP1)n (p2) > 1*(p3)77 (p4) in massive scalar electrodynamics, i.e., with 
Vi = go" dA, 


where A, the analog of the electromagnetic field, is a Hermitian spin-0 field with mass u Æ 0. 
The analog of the charge is g, which now has dimensions of mass. 

(a) Find expressions for the differential and total cross sections in the CM to lowest non- 
trivial order, in terms of s, m, u, g, and cos @. 

(b) Define the dimensionless variable x = s/m? > 4, and specialize to the values m = g = 1 
GeV, u = 0.5 GeV. Plot do/dcos@ vs cos@ in units of 1 fm? = 10~7° cm? for x = 4, 4.2, 
and 4.4. Use any plotting program. 

(c) Plot o in units of fm? versus x for the same parameter values and the range 4 < x < 5. 
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2.9 Prove directly from the defining relations that 


2m (üz u1 H iūz20"” (po — pı )vur 


2m (ti2y" yur 


H iti2a"” (po + pi)puU1 


So Oa J 


where u1,2 are two Dirac u spinors for a particle of mass m. 


2.11 Prove the identity 


ara = gP H Pg” aay gh A Pyg. 
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1 +iūz0"” (pə + pı) y’u 


2.12 Show by explicit construction that the Pauli-Dirac and chiral representations are 
related by a unitary transformation, i.e., that there exists a unitary matrix U such that 


UyepU' = ean Uupp(p, s) = uch (P, s), 


(The extra sign in the v-spinor transformation is due to a sign conventio 


2.13 The angular momentum operator for a Dirac field can be written 


; 1 gap 
Jt = EOI + orbital, 
where normal ordering is implied. Show, in the free field limit, that J? 
behavior 


ai Di pac X 
J? |b (B, s1,2)) = Ezi% (P, $1,2)), J? IY: (P, s1,2)) = 4 
where w~ and w° represent particle and antiparticle states, p is in the 2 


the orbital terms do not enter), and s = sı or s2 represent spins in the 4 


2.14 Derive the results in (2.181) for the helicity projections in the ma 


Uvpp(p, 8) = —ven(P, 8). 


n.) 


as 


has the expected 


-ZWE (i 812)) 


direction (so that 
t2 direction. 


ssless limit. 


2.15 Weak charged current transitions involve the chiral spinors uz (p,s) and vgr(p, s) 


defined in (2.203). Show that in the relativistic limit such transitions main 


ly involve negative 


helicity particles or positive helicity antiparticles, and estimate the suppression factor for 


transitions involving the “wrong” helicity. 


2.16 Show in two ways that |i(p2,+)u(p,—)|? = 2p1 - p2, where u(p, 4 
ity spinors for a massless fermion: (a) directly from the form of the spi 


representation, (b) using trace techniques. 


2.17 Prove the Fierz identity in (2.216). 


) are the helic- 
inors in the chiral 
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2.18 Suppose a fermion w of mass m interacts with a Hermitian scalar ¢ of mass u with 


Ly = hid, 


where h is small. 

(a) Calculate the spin-averaged differential cross section for w(pi)w(p2) > w(p3)v(p4) in 
the CM in terms of the invariants s, t, and u. 

(b) Specialize to m = u = 0. Show that the scattering is isotropic in that limit and calculate 
the total cross section. 


2.19 Consider e` (kı) r+(p1) 3 e` (k2) nt(p2) elastic scattering. Show that the spin- 
averaged differential cross section in the pion rest frame is 

On 

= 2 


dcos@p 9,2 ain 4 Or L 2ki 2 On |” 
2ky sin“ = |14 maa bk ag 


da Ta? cos? 


where 0z, is the electron scattering angle and we have neglected the electron mass. Hint: use 
(2.224). 


2.20 Calculate the CM differential cross section for the process e~(p;)u+t(ki) > 
e` (p2)u* (k2) in terms of s = E2j,, the CM scattering angle 6, and the muon mass m,,. 
Neglect the electron mass. 


2.21 Verify the expressions for Bhabha scattering in (2.234) and (2.235). Rewrite the final 
result in terms of s and cos ð. 


2.22 Calculate the differential cross section for unpolarized Møller scattering, e~e~ —> 
ee, both in terms of the invariants and 6. 


2.23 Calculate the spin-average differential cross section da/dcos@ in the center of mass 
for e~ (p1)e* (p2) > m~ (p3)r* (p4), and the total cross section a. Neglect the electron mass 
but not the pion mass. Ignore strong interaction effects. The angular distribution should be 
proportional to sin? @. Interpret this result. 


2.24 (a) Consider the Mott scattering process in which an electron of momentum p = BE 
scatters from a static Coulomb potential of charge Ze, 


_ Ze 
— Ar|] 


A" (zx) (1,0, 0,0). 


Show that the unpolarized (Mott) cross section for scattering angle 0 is 


da (Za)?n(1 — 6? sin? £) Zar 


dceosô 262p? sint g B<1 282p? sinf g 


The last formula is the Rutherford cross section. 
(b) Suppose the Coulomb potential for an electron in a nuclear field transformed as a scalar 
rather than as the time component of a four-vector, i.e., 


_ Ze 
~ An|z# |" 


Hr = —ep(x)b(x)o(z), g(x) 


Calculate the unpolarized differential cross section, and compare it with the Mott formula. 
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2.25 The interaction of the Z (a massive neutral vector boson in the electroweak theory) 
with a fermion f is 


L = —Gh(x)q" (gv — gay”) Y£) Zp (2), 
where G, gy, and ga are real constants. Calculate the width for Z > ff. Let Mz and m 
be the Z and f masses, and set G = 1. 


2.26 The A is a heavy spin-4 hyperon that decays into pr~ via the non-leptonic weak 
interactions. The decay interaction can be modeled by 


Lr = ty(gs — gPY)Wadbnt + h.c., 


where gs and gp are complex constants that lead, respectively, to S and P-wave final states. 
(a) Calculate the width I and the differential width dI'/dcos@ in the A rest frame for a 
polarized A, where @ is the angle between ŝ4 and the proton momentum pp. Use trace 
techniques. 

(b) Show that dI'/dcos@ is not reflection invariant for Re (gpg) # 0, i.e., that it is not 
invariant under pp > —Pp, 8a + Sq. 

(c) Repeat (a), but use explicit expressions for the A and p spinors in the Pauli-Dirac 
representation. Justify the claim that gs and gp generate S and P-wave amplitudes. 


2.27 A vector resonance V,, of mass My and width Ty couples to massless fermions a 
and b with the interaction in (F.12) of Appendix F. Calculate the total spin-averaged cross 
section for aa — bb. Assume that the propagator in (2.129) is modified to the Breit-Wigner 
form Ta 

=g" + ME 
k? — M2 +iMyly 


$ 


Dt” (k) =i | 


and express the result in a form similar to (F.11). 
2.28 Consider the interaction 


Lı =g (varvord + PorVar') 


between distinct fermions pa and Yp, where ¢ is a complex scalar and g is real. Show that 
the Lagrangian violates P and C, but is CP invariant. 


2.29 Consider e~ (k1) t+ (p1) 3 e` (k2) 7+ (pe) scattering, as in Figure 2.15. Use the two- 
component formalism of Section 2.11 to calculate the amplitudes M(—, —) and M (+, +) for 
Me = 0 and m, 4 0. Express your results in terms of a, r, and the CM scattering angle 6. 
(The two amplitudes should be equal up to a possible sign by (2.278).) 


2.30 Consider the non-relativistic limit of the matrix element (2.356) for an e~ in a static 
external field. 

(a) Compute the limit to linear order in the momenta. Hint: use the explicit forms for the 
spinors in the Pauli-Dirac representation. It simplifies the calculation to rewrite u2I"u, 
using the Gordon decomposition. 

(b) Suppose that T” (p2, pı) in (2.352) contained a term “8 q,7°G2(q?). This violates P and 
T but in principle could be generated by a new interaction. Show how G2(0) is related to the 
electric dipole moment de of the electron, which is defined by the non-relativistic interaction 
Hgpm = de: E(& ), where E is an external electric field. Note that the Hermiticity 
condition (2.354) requires that Gə is pure imaginary. 
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2.31 Suppose there is a small electric charge-violating coupling between the electron and 
a massless left-chiral neutrino vz, with 


L= —de Ay ty, Y Ype + h.c. 


Calculate the lifetime for e~ — vgy, and find the value of 6 corresponding to the limit in 
Table 2.3. 


2.32 Consider the proton matrix element t(p2)P'G(q)u(p1) of the electromagnetic current 
in (2.393), with ro given by (2.397). Calculate this explicitly in the Breit frame, in which 
0 . 
q =0, i.e., 


q= (0,0,0, VQ?), pı = (E,0,0,-/Q?/2), p2 = (E, 0,0, +V/Q?/2). 


Express the time and space components in terms of the electric and magnetic form factors 
defined in (2.398) and interpret the results. 


2.33 Let V (zı — Z2) be the potential between two non-identical spin-4 particles in non- 
relativistic quantum mechanics (NRQM). (V may also depend on their spin and momentum 
operators). One shows in time-dependent perturbation theory that the transition amplitude 
Upi from |i) = |p 51, P282) to |f) = |p353, Pasa) with mi = Mg, M2 = m4 is 


Upi = —i(27)*8* (p3 + pa — pı — p2) piel (/ aeaa $192, 


where ġ; is a two-component Pauli spinor, and V contains appropriate spin matrices. Note 
that these states are in our covariant normalization convention, which has an extra factor 
/2r)32E; ~ \/27)32m; for the it” external particle compared to the usual conventions of 
NRQM. The corresponding formula in field theory is 


Upi = (27)*8* (pa + pa — pı — p2) M, 


where M is the scattering amplitude with the phase convention of Appendix B. Comparing 
these results, we can read off the equivalent non-relativistic potential corresponding to a 
given scattering amplitude. Specifically, for 


Bi = -0 =p- 
the non-relativistic limit p — 0, |7|? < m? yields 

M > —i(2m1)(2m2) POV (d) = ~i(2m,)(2ma)ohol | PF TVE) 
Non-leading terms in p can be interpreted as the non-relativistic momentum operator. 


(a) Calculate the potential corresponding to the effective four-fermi Hamiltonian density 


Hr = Apiti Poya. 


(b) Consider the interaction 


Lr = [apiy + goo] ¢ 


between two fermions and a Hermitian scalar of mass mg. Calculate the potential between 
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pı and wz generated by ¢t-channel ¢ exchange, and show that it is attractive for g,g2 > 0. 
(c) Calculate the potential generated by 


Lr = [gidiy' + g2" p2] Vu, 


where V, is a spin-1 particle of mass My. Show that it is repulsive for gig2 > 0. 

(d) Repeat parts (b) and (c) for the case gig2 > 0, but for the potential between antipar- 
ticle 1 and particle 2 and interpret the results. Hint: it is slightly easier to use the charge 
conjugation formalism of Section 2.10. 

(e) Consider the interaction in (2.269) of a m° with protons and neutrons. Calculate the 
tree-level amplitude for p(p1) n(p2) > p(p3) n(p4) by t-channel 7° exchange, and show that 
it leads to the non-relativistic potential 


2773 =% 
gom {1, Le S/1 3 3\ _, 
V(r) = ‘On ; 
(P) l6mms 3P On 3 G x i x3 jÂ 


where Mp ~ Mn, Fp(Gn) are the Pauli matrices acting on the p(n) spin, x = m,r, and S is 
the tensor operator 
S = 3p f On: f — Op Gn. 


2.34 Suppose that a new lepton-flavor violating interaction leads to the effective interac- 
tion g 
Leff = igt Eou [A+ By] u+ h.c., 


where A and B are, respectively, magnetic and electric dipole transition moments. Calculate 
the decay rate for u > ey, neglecting me. 


CHAPTER 3 


Lie Groups, Lie Algebras, 
and Symmetries 


DOI: 10.1201/b22175-3 


Lie groups and algebras are used to describe continuous global and gauge symmetries in 
classical and quantum mechanics and in field theory. A familiar example is the description 
of rotational invariance in quantum mechanics. In particle physics Lie groups are useful not 
only for space-time symmetries such as translations, rotations, and Lorentz transformations, 
but also for internal symmetries such as isospin. It is assumed that the reader is familiar with 
such basic notions as irreducible representations (IRREPs), direct products, and irreducible 
tensor operators. Excellent introductions more detailed than the treatment here include 
(Georgi, 1999; Yndurain, 2007; Gilmore, 2005; Ramond, 2010; Barnes, 2010). Finite discrete 
groups are treated in detail in (Ramond, 2010), and more briefly in Sections 2.10 and 3.2.5. 


3.1 BASIC CONCEPTS 


3.1.1 Groups and Representations 


A group G is a set of elements g1, g2- that has 


An associative multiplication law, under which gig2 = g3 for each gi2 € G, with 
g3 € G (closure) and (g1g2)93 = g1(g293) (associative). 


An identity element J € G with Ig = gI = g for all g € G. 


1 


A unique inverse element g~! for each g € G, such that gg! = g7™tg = T. 


The elements may be discrete (with either a finite or countably infinite number) or may 
depend on a continuous parameter. An abelian (commutative) group is a special case, defined 
by 9192 = 9291 for all gı 2 € G. Otherwise, G is non-abelian. A subset of the elements that 
itself forms a group under the same multiplication law is a subgroup of G. 


(a) The set of integers n with the operation of ordinary addition is an example of an 
abelian group with a countable number of elements. The identity element is 0 (i.e., 
0+n=n+0=n) and the inverse is —n. 


(b) The set of rational numbers r other than 0 under ordinary multiplication forms another 
countable abelian group. The identity element is 1 and the inverse is 1/r. 
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n—1 


(c) The cyclic group Zn consists of the nt? roots of unity, i.e., G = {1, Wn, W2: w? 


where wn = e27*/", Z, is abelian and finite. 


(d) The quaternion group is finite and non-abelian. It consists of the eight 2 x 2 matrices 
{+I,+io*}, where I is the 2 x 2 identity and ot, i = 1,2,3, are the Pauli matrices. 
Multiplication i is defined by the ordinary matrix produce: in (1. 18) on page 4, so, e.g., 
(iot)! = —io*. The subset {+/,+ic?} forms an abelian subgroup. 


(e) The symmetric group Sn is the group of permutations of n objects. It has n! elements 
and is non-abelian for n > 2. The alternating group An is the subgroup of even 
permutations. It is non-abelian for n > 3 and has n!/2 elements. 


(f) The set of non-singular m x m matrices A forms a non-abelian continuous group under 
ordinary matrix multiplication. The identity is the m x m identity matrix and the 
inverse is the matrix inverse A~?. 


A Lie group G is a continuous group for which the multiplication law involves differen- 
tiable functions of the parameters that label the group elements. Most of the Lie groups of 
interest in particle physics are compact, which means that the parameters form a compact 
manifold (the Lorentz group, described in Section 10.2.2, is a notable exception). A Lie 
group and its multiplication law can be defined, at least for elements close to the identity 
(infinitesimal transformations), in terms of its associated Lie algebra, which consists of N 
generators (operators) T’, i = 1,2---N, and their commutation rules 


[4 Tİ] seg. T", (3.1) 


where a summation on k is implied and the Cijk = —Cjik are the structure constants of 
G. Without loss of generality, one can choose the T’ to be Hermitian, in which case the 
structure constants are real. If all of the cj, = 0, then G is abelian; otherwise, it is non- 
abelian. An element of a compact G can be represented as a formal power series involving 
the generators, by the unitary operators 


as Gow —iß. T \k 
Uc(B) = exp[-i) 6'T'] =e PT=)S > ut) l (3.2) 


where 3!--- 8% are N continuous real parameters and th e T’ are Hermitian. In particular, 
the identity element is Ug(0) = I, and the inverse of Ug(ĝ) is 


Uc(B)* = Ue(-B) = el * = Ue (6). (3.3) 
For small | B | it is sufficient to keep just the linear term in (3.2), 
Ug(8) = I — if -T + O(6:6;), (3.4) 


i.e., the generators of the Lie algebra describe the group elements close to the identity. The 
Tie algebra also defines the group multiplication law for arbitrary B. That is, 


Ue(@)UG(B) = e Te T = Ugla). (3.5) 


Aa, B ) can in principle be expressed in terms of the Lie algebra (the Baker-Campbell- 
Hausdorff construction), although there is no closed form expression in general. However, 
for small |@| and |£ | 


la- T, 8- T] + h.o.t. (3.6) 
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Now consider a set of n x n dimensional matrices L’, i = 1,2---N. If the L" satisfy the 
same algebra as the generators of a Lie algebra, 


[L*, LÍ] = icije L*, (3.7) 


then the L’ (sometimes written L’,) are said to form a representation of the algebra, and are 
Hermitian for the choice of Hermitian T’. That is, we are considering the TŻ to be abstract 
operators, while the L’ are a specific matrix realization. Similarly, the n x n matrices e~*?'4 


form a representation of the group elements Uc(f) and have the same multiplication law. 
For the compact groups one can choose Hermitian generators normalized such that 


Tr (L'L?) = T(L)6;;. (3.8) 


The Dynkin index T(L) > 0 depends on which representation is being considered and on 
an overall normalization convention, but is independent of i and j. With these conventions, 
one can show that c;,;, is totally antisymmetric in all three indices (Problem 3.2). 


3.1.2 Examples of Lie Groups 


The simplest example of a Lie group is the abelian G = U(1), with a single generator T, so 
that 
Ug(B) =T, Ue(a)Ua (8) = et". (3.9) 


There is an n = 1 dimensional representation, with L = 1 and group elements Ug(3) > 
e~*®, U(1) is named for this representation, i.e., the 1 x 1 dimensional unitary matrices 
(phase factors). 

G = SU(2) is a non-abelian group with N = 3 generators and cj;, = €ijk- SU(2) is 
named for its defining representation, the 2 x 2 unitary matrices (U(2)) with the extra 
constraint that they are special, i.e., their determinant is unity (SU(2)). The generators of 


the defining representation are Lê = +, where rê = ot are the Pauli matrices in (1.17), 


so their Dynkin index is 5 by (1.18). The ZL’ are Hermitian, so the group representation 


elements 
U(B) =e 2 = cos =I — isin cae (3.10) 


are unitary 2 x 2 matrices. Furthermore, Trt’ = 0, so they are special, 


det [e73] = eH E) — e =], (3.11) 


The adjoint representation of SU (2) is the 3 x 3 representation constructed from the struc- 
ture constants, (Lia ik = —té;;,. There are additional representations for n = 4,5--- 00. 
SU(2) is useful in nature for describing rotational invariance, the approximate isospin in- 
variance of the strong interactions, and the weak isospin gauge symmetry of the electroweak 
interactions. 

The group SU(3) plays two major roles in the standard model: as a gauge symmetry 
associated with color for the strong interactions (QCD), and as an approximate global flavor 
symmetry of the strong interactions (the eightfold way). SU(3) can be defined in terms of 
its defining representation, the 3 x 3 unitary matrices with determinant one. There are 
N = 8 generators, with (Hermitian) matrices in the defining relation given by Li = \'/2, 
where the Gell-Mann matrices \’,i = 1---8, are given in Table 3.1. The structure constants 
Cijk = fijk, i,j,k =1---8, are listed in Table 3.2. The Gell-Mann matrices satisfy 


Tr ($5) 2) si. eae de (e=) =1. (3.12) 
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There are two diagonal matrices, à? and AS, i.e., SU(3) has rank two. The AŻ satisfy the 
commutation (Lie algebra) and anticommutation rules.! 


[Ae] atta”, {A,A} = sour + 2dijp à". (3.13) 


The dijk are symmetric in all 3 indices, with the nonzero ones listed in Table 3.2. SU (3) has 
many other representations, such as a second inequivalent 3-dimensional conjugate repre- 
sentation L$. = —\A*™* /2 = —A'T /2, and the 8 dimensional adjoint (Li,4;);x = —ifijk: There 
are several SU (2) and U (1) subgroups of SU (3), such as the SU (2) associated with L!?3. 


TABLE 3.1 The Gell-Mann matrices." 


“i = 1,2,3 in the first entry. 


TABLE 3.2 The nonzero (totally antisymmetric) structure constants fij for 
SU (3), and the nonzero (totally symmetric) dijk defined by the anticommutators 
of the Gell-Mann matrices. 


I I I i 
fiz3 = 1 fs = 3 dis = yy dass = Fy ds58 = — 575 
1 1 1 1 1 
fir= 3 fss7=-3 disse =5 d344 = 5 dees = — 375 
=i = a So — 4 
fiss=-} fas =% d=} dass=3 dm=- 
1 3 1 1 1 
foas = 53 fers = v3 dow = 5 d366 = — 3 dsss = — Fy 
1 1 1 
fos7 = 3 da7 =—3 377 =—5 
1 1 
d256 = 3 dss = — 375 


3.1.3. More on Representations and Groups 


The rank of a Lie group is the number of generators that are simultaneously diagonalizable. 
The diagonal generators correspond to conserved quantum numbers if they commute with 
the Hamiltonian. U (1), SU (2), and SU (3) have rank 1, 1, and 2, respectively. 

Two n x n representations L' and L” of G are equivalent if all N of them are simulta- 
neously related by a similarity transformation, i.e., if there exists an n x n unitary matrix 
U such that 

L’ = ULU} for i=1-N. (3.14) 


Otherwise they are inequivalent. 


lt is sometimes convenient to define \° = 2/31. Then, Tr (AŻAÍ) = 25% and {4,4} = 2d A", E 
i,j,k =0,1---8, with doje = V2/38;k. 
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A representation L’ is reducible if it is equivalent to a representation 


L o 0 0 
P 0 L 0 0 
L'=| 0 0 Z4 0 (3.15) 


0 0 0 


in which each element is simultaneously block diagonal (with the same block dimensions). 
Otherwise, it is irreducible (an IRREP). States transforming according to a reducible rep- 
resentation separate into sectors not related by the symmetry, while all of the states in 
an IRREP are related. Simple Lie groups have an infinite number of IRREPs, and they 
frequently have inequivalent IRREPs of the same dimension. 

A fundamental representation is, roughly speaking, a representation from which the 
others can be generated by direct products, in analogy to the way that any angular mo- 
mentum j in quantum mechanics may be generated by combining 27 angular momenta L, 
The defining representations (m) of SU(m), such as the 2 of SU (2) in the example, are 
fundamentals. 

The adjoint or regular representation of a Lie group is the N x N dimensional represen- 
tation constructed from the structure constants, 


( adj) jk = —1Cijk: (3.16) 


It is straightforward to show that Lia satisfy (3.7) (Problem 3.3). The adjoint is essential 
for defining the self-interactions of the gauge fields in a non-abelian gauge theory. 

If Lİ is an n dimensional representation of a Lie algebra, then the conjugate L',. = 
—L'* = —L'T is also a representation. Ln is real? if it is equivalent to Li», i.e., if there 
exists a unitary U such that —Li* = UL,U' for i =1---N. Otherwise, it is complez. The 
adjoint representation Lia; is always real, with U = I. The 2 of SU(2) is real, i.e., 

gor 2 T? 2 
Lo» = Fan (3.17) 
so that U = 7?. The higher-dimensional SU (2) representations are also real. On the other 
hand, the m of SU(m) for m > 2 is not equivalent to the m*, which is also a fundamental 
represention. For example, L3» = A> in SU(3) is not equivalent to L3 = x. This is 
important for the Higgs Yukawa couplings in extensions of the electroweak SU(2) group to 
higher symmetries. 


The Simple Lie Groups 


Two groups G; and G2 commute if [g;, ĝ;] = 0 for all g; E€ Gi, 9; E Go. Then, one can 
define the direct product group G = G1 x Gz with elements giĝj, or direct products of more 
than two groups, such as the standard model group SU(3) x SU(2) x U(1). A simple group 
is (non-rigorously) a non-abelian group such as SU(3) that is not a direct product.? A 
semi-simple group is basically a direct product of simple groups, i.e., a Lie group with no 
U(1) factors, such as SU(3) x SU(2). 


Mathematics books typically work in terms of iL, motivating the term “real”. 

3More precisely, a subgroup H of a Lie group G is an invariant subgroup if ghg—! € H forallg € G,h € H. 
G is simple if it contains no invariant subgroups (other than the identity and G itself), and semi-simple if 
it contains no abelian invariant subgroups. Compact semi-simple Lie groups are either simple or the direct 
product of two or more simple groups. 
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Cartan has given a classification of the simple Lie algebras. The classification as well 
as the IRREPs and their properties are elegantly derived from Dynkin diagrams (Slansky, 
1981), but here we only give the results. There are four countably infinite series of classical 
Lie algebras, and five exceptional algebras, as listed in Table 3.3. The four series correspond 
to simple matrix conditions for the defining representations of the associated groups: 

SU(m),m = 2- - - o0, correspond to the m x m complex unitary matrices Ug = e774 
with unit determinant, 
UGU} =I, detUg = 1, (3.18) 


which implies that B - L are the traceless Hermitian matrices.* Ug leaves invariant 
the inner product of two m-dimensional complex vectors, i.e., ya = y'ta’, where 
C= Uha and similarly for y’. 


SO(m) are the m x m real orthogonal matrices Og with unit determinant? (i.e., 
rotations in an m-dimensional real space) 


OcOG =I,  detOg=1, (3.19) 
so that Og = e FL with iB - L real and antisymmetric. The inner product yT a of 
two real vectors is left invariant under an Og transformation. 

Sp(2m) are the real 2m x 2m symplectic matrices M, defined by 
M'SM =S, (3.20) 


Om Im 
respectively, the m x m zero and identity matrices. They therefore leave invariant the 
quadratic form yT Sx, where x and y are 2m-dimensional real vectors. 


where S is the skew symmetric matrix S = , where Om and Im are, 


The defining representations of SU(m) and S'p(2m) are also fundamental and can be used 
to generate the higher-dimensional IRREPS as direct products. For SO(m) one can derive 
higher tensor representations (including the adjoint) from the defining or vector (m). How- 
ever, there are additional double-valued fundamental spinor representations, similar to the 
familiar 2 of SO(3) ~ SU(2) (see, e.g., Li, 1974; Slansky, 1981). All of the IRREPS can be 
generated as direct products of the fundamental spinor. SU(m), SO(m), and some of the 
exceptional groups have found considerable application in physics. Recently, Sp(2m) has 
emerged in connection with string theory. 


Casimir Invariants 


A Casimir invariant is a function f(T) of the group generators T’ that commutes with them, 
(f(T), T] = 0. By Schur’s lemma the corresponding function f(L) of an n x n IRREP L 
is a multiple of the identity. The coefficient may depend on the representation and may be 
used to label it. The simplest example is the quadratic Casimir 


N 
P=) LL = (L). (3.21) 
i=l 


4The U(m) group (which is not simple) is related by U (m) = SU(m) x U(1), where U(1) is defined in 
(3.9) with T the m x m identity matrix. 

5The orthogonal group O(m) consists of the transformations Og and ROg, where Og € SO(m) and R, 
which represents a reflection in an odd number of dimensions, is a diagonal matrix with elements +1 and 
det R = —1. 
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TABLE 3.3 The Cartan classification of simple Lie algebras.“ 


Cartan label Classical group N Range 
Ag SU(é+ 1) L(L + 2) €>1 
Be SO(2é+ 1) £(20 + 1) L>2 
Ce Sp(20) £(2€ 4+ 1) L>3 
De SO(28) £(2@ — 1) l>4 
Go 14 
Fy 52 
Eg 78 
Er 133 
Eg 248 


"The groups SO(6) ~ SU(4), SO(4) ~ SU(2) x SU(2), SO(3) ~ SU(2) ~ Sp(2), Sp(4) ~ SO(5), 
and SO(2) ~ U(1) have the same Lie algebras but may differ for non-infinitesimal transformations. The 
subscript in the first column is the rank. 


A familiar example is J? = j(j + 1)I for the angular momentum j representation of the 
rotation group. C2(L) is related to the Dynkin index T(L) defined in (3.8) by 


T(L)N = C3(L)n. (3.22) 


The quadratic Casimir of the adjoint T(Laqj) = C2(Laaj) is also written as C2(G). From 
(3.16), 

CiklCjkl = Co(G)d;;. (3.23) 
The quadratic Casimirs and Dynkin indices for the defining and adjoint representations of 
the classical Lie algebras are given in Table 3.4 (see also van Ritbergen et al., 1999). Other 
useful identities (which can be used to construct other invariants) are 


Bo So 1 s 
DTI = |C2(L) - 5C2(G)| D 

(3.24) 
Cijk LLI = -CO(Q)LF, 


from which Tr L* = 0 for the generators of a simple Lie group. 


TABLE 3.4 Quadratic Casimirs and Dynkin indices for the defining 
representation Ln and adjoint representation of the classical Lie algebras. 


G N C2(G) n T(Ln) C2(Ln) 
wA = a a 
SO(m) fi) 2(m — 2) 9 i 
Sp(2m) m(2m + 1) m+1 2m 1 2m+1 


More on SU(m) 


Properties of the SU(m) IRREPs and their direct products can be found systematically 
from the Young tableaux (e.g., Cheng and Li, 1984; Patrignani, 2016) or the more general 
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Dynkin methods. However, many aspects of SU(m) are simple enough to “do it yourself.” 
For example, the fundamental Li, = \*/2 with Tr (\'\2) = 26% can be written as an 
obvious generalization of the 3 x 3 matrices in Table 3.1, and the structure constants can 
be calculated from them. An important property of SU(m) (that does not generalize to the 
other simple groups) is that the L$, along with the identity form a complete set that can 
be used to write any m x m matrix, 


m?-1 
1 1 fos 
M=—Tr(M)I+ = Tr (MXX 2 
rt rs 2 r (MAJA, (3.25) 
where the coefficients are generally complex. In particular, the anticommutator {X’, AJ } is 
a linear combination 


4... 

iA eh eo tt Qdijnr*. (3.26) 

where the totally symmetric dij generalize those in Table 3.2. This allows one to generalize 
the SU(2) Fierz identity in Problem 1.1 on page 5 to SU(m), 


2 dys) tna), (3.27) 


OÅ Ax) O4 xı) = 2ne(xa)ordxs) - = 


where nr = +1 if the x; are m-dimensional complex vectors or complex scalar fields and 
nr = —1 for anticommuting fermion fields. 

SU(m) tensor methods, discussed in Section 3.2.3, are especially useful for constructing 
SU(m) singlets from direct products. 


3.2 GLOBAL SYMMETRIES IN FIELD THEORY 


3.2.1 Transformation of Fields and States 


In field theory, groups consist of symmetry operations that leave the equations of motion 
unchanged in form. These may be discrete symmetries, such as P, C,T discussed in Section 
2.10, or discrete internal symmetries, which will be considered in Section 3.2.5. Here we are 
more concerned with continuous groups. One important class is the space-time symmetries, 
such as space rotations, Lorentz boosts, and translations. Another, considered in this section, 
is internal symmetries, involving the interchanges of fields with similar properties, changes 
in their phase, etc. To formalize this, let ®,(x),a = 1,2---n, be n fields (which may be 
spin-0, 4, 1, etc.) related by a symmetry. Furthermore, consider a Lie group G of operators 
Ug(B) = e-®T Then, define a set of n transformed fields 
p, =e PTE (xe tT 
Gul 
2! 


where the last form follows from the operator identity in Problem 3.5. Thus, the transfor- 
mation of the fields is determined by their commutators with the group generators T’. Since 
we assume that the ®, are transformed into each other, one must have that 


[T*, Ba (x)] = —Li,80(z), (3.29) 


B-T, 8- T,@a] t, (3.28) 


where L}, are the components of an n x n matrix L’, which is easily shown to form an n x n 
dimensional representation of the Lie algebra of G. From (3.28) and (3.29) one has that 


D, = (ct 4) as = U(B)as® o Pa + il- Labo, (3.30) 
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which defines how the n fields corresponding to representation L are transformed into each 
other. One usually considers the case that L is irreducible. 

There is a frequently useful matrix notation for fields AŻ transforming under the adjoint 
representation? (3.16), in which the A’ are reexpressed in terms of the elements of an n x n 
matrix 

— iri ;_ Ir(AL) 
a ee >A TL)” 


(3.31) 


where L’ is an arbitrary non-trivial IRREP of dimension n (usually taken to be the fun- 
damental or defining). It is then easy to show (Problem 3.7) that the transformation of 


A = At = (ci Laas), A! can be expressed in terms of representation L by 


N EEn as , Iri 
A> A! = XO'L E etib L Ae th Alt = Tr (A L ) 
= oe) 


(3.32) 
As described in Chapter 2, if ®, corresponds to a particle, then the antiparticle field 
is given by or closely related to ®t. There are two possibilities for the transformations of 


non-Hermitian fields. One is that the fields for the particle and antiparticle are in the same 
IRREP, such as the pions 


S=| x |, (3.33) 


which transform as a triplet under SU(2) isospin. This requires that the representation is 
real, such as the adjoint in this example. Alternatively, the particle and antiparticle fields 
can be in different IRREPs, such as the kaons under isospin 


+ = 
o=( jo } ot=( g) (3.34) 
Then, if ® transforms under the n representation Lê, ®t transforms under the conjugate 
representation Li. = —Li* = — LiT, which follow by taking the adjoint of (3.29) and using 
that T’ is Hermitian. Of course, Lt, may be real, as in the SU(2) example or for the adjoint 
of SU(3). 
From the expressions (2.93) or (2.159) on pages 23 and 35 one sees that for free fields 
the single particle state corresponding to ®, may be constructed by 


Ja) = al]0) ~ ©$10), (3.35) 


where in the second expression it is understood that a Fourier transformation and appro- 
priate projections of Dirac spinors, etc., are to be performed. Thus, the states |a) and ®t 10) 
transform the same way under G. This continues to hold for interacting fields as long as 
the TË commute with the Hamiltonian H. Then, the action of the generator on the state is 


T'\a) ~ T’dt|0) = Ot T*|0) + LiF! lo). (3.36) 
Assume for now that the ground state is invariant, i.e., T’|0) = 0. Then, 
T'|a) = |b) Liq = |b) (b|T"|a), (3.37) 


so that the representation matrix Li, = (b|T"|a) is just the matrix element of T’ in the 
n-dimensional space of particles. 


6The adjoint representation is real, so there is no distinction between upper and lower indices, i.e., 
At = Aj. 
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3.2.2 Invariance (Symmetry) and the Noether Theorem 


> 


The Lagrangian density £ is invariant or symmetric under a group of transformations Ug(8 ) 
if they commute, i.e., if in E 
L' =Ug(8)LUG(B)* = £ (3.38) 


for all 8 . (A similar definition applies for invariance under discrete transformations.) Since 
psg [2 f, c] (3.39) 


for small ||, invariance holds if and only if 
[T*, £] =0 (3.40) 


for all i. 

The first part of (3.38) defines the transformation of £ whether or not there is an exact 
symmetry. Since £ is a function of (®,,0,,®q), and (for a non-Hermitian field) of (®t, 0,61) 
one has that 

L’ = Ue (B)LUG(B)* = L(a, 0,0, OF, 0,87), (3.41) 
where 7, is given in (3.30), with an analogous expression for 0,,®/, (since we are considering 
global transformations, B = constant). The expressions for ®/' and 0,,/' are similar except 
that L’ > —L"™. It is frequently useful to consider explicit symmetry breaking, i.e., 


ÔL = L' — L £ 0 (but small). (3.42) 
One can also have spontaneous symmetry breaking 
L'=CL but T"|0) £0, (3.43) 


i.e., the Lagrangian is invariant but the ground state breaks the symmetry (cf., the breaking 
of rotational invariance in a ferromagnet). Both of these cases will be considered extensively 
below, but for now consider an exact symmetry, 


2) = 0 and 70) = 0. (3.44) 


This implies degenerate multiplets of particles and definite relations between their inter- 
actions. It also implies conserved currents and charges according to the Noether theorem, 
which generalizes the result for a single complex scalar field discussed in Section 2.4.1. The 
Noether theorem for internal symmetries is 


, d 
H = —ỌQ' = 
3p" =0, g =0, (3.45) 


where the Noether current and charge are 


ee ae BE 
= ‘snp, “ave EES 


(—Lis) of, (3.46) 


= / PI (tz). (3.47) 


Normal ordering on the fields is implied. The Noether theorem can be derived from the 
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Euler-Lagrange equations, in analogy with the derivation for the U(1) case in Section 2.4.1. 
One can use the canonical commutation rules to show that 


(Q, Q] = ici", — [Q", Ba(x)] = -Laode (2). (3.48) 


That is, one can identify Q’ = T* as a concrete construction of the generators in terms of 
the fields. 

The Noether currents are also useful for explicitly broken symmetries. The Noether 
charges are no longer time independent, but one can use the canonical commutation rules 
to show that the commutation rules in (3.48) still hold provided the charges and fields are 
evaluated at equal times. For example, suppose £L = Lo + £1, where only Lo is invariant, 


[T*, £o] = 0 (all i), [T’, £1] #0 (some i). (3.49) 
Then, the change in £ is related to the divergence of J, 
ôL=L' -L= |-i8- T, £| =- J, (3.50) 
This immediately implies 
angi =i [TL] > / dz ors: = —i [T', H], (3.51) 
where the last step assumes that any symmetry breaking is in the mass and interaction 


terms (i.e., that kinetic energy terms are canonical). The (non-conserved) T’ and 0" J}, are 
evaluated at the same t. Equation (3.51) implies that 


(alpa) |3" Fjlb(Pe)) 


(a(pa)|T' lpo) = i(27)P 8 (Pa — Bh) 


(3.52) 


This relation is useful when states a and b are not related by the symmetry and are not 
degenerate in the symmetry limit. One then has that the leakage of T’|b} into |a) is pro- 
portional to the symmetry breaking. 


The Complex Scalar 


As a first example, consider an IRREP relating n complex scalars ¢g,a = 1---n, trans- 
forming as |[T*, ġa] = —L*,¢, under some Lie algebra that will be determined from the 
symmetries of the Lagrangian density, 


p= 5 (Ouba)! (3 pa) + non-derivative terms. (3.53) 
a=1 


The Noether currents are = 
Ji = ip} Lin On Po; (3.54) 


where fo, g = f (Ong) — (Ou f)g. The derivative (kinetic energy) terms are invariant under 
the group U(n) = SU(n) x U(1). Under an SU(n) transformation 


o> elTe, g + gte PE, (3.55) 
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where ġ is the n-component column vector (¢, $2::-¢n)", ¢' is the row vector 
(¢| o4--- 1), and L’ is the fundamental representation matrix Li, of SU(n). The SU (n) 
invariance is obvious with this matrix notation 


Lre = (3 p (86) > (3,0) et Eet (a4) = (0,0)! (04). (3-56) 


Lee is also invariant under U(1) transformations, ¢ > e’°/¢. U (n) is the maximal possible 
symmetry group of the system; depending on the mass and interaction terms the symmetry 
may be smaller. 
Including mass terms 

L= (8,6)! (IG) — Pu’, (3.57) 
where $176 = ¢t u2,¢» and u? is an n x n matrix with elements p2,. The Hermiticity of £ 
requires that u? is Hermitian. The eigenvectors and eigenvalues of u? correspond to states 
of definite mass and to their mass-squares, respectively. For now, however, let us assume 
that u? is already diagonal, 4? = diag (uf u3- -- u7). Under a group transformation, 


Puo p'e PE petig, (3.58) 
so the requirement for invariance is that 


eT L 2 eHiBL = pe (3.59) 


for all B , which is equivalent to 
|. Ž, p| =0. (3.60) 


Equation (3.60) determines what subgroup of U (n) survives. For example, if all of the masses 
are the same, u? = p?I, then £ is invariant under U (n). If all the masses u2 are different, 
then the symmetry is reduced to 


U(1)” =U(1) x U(1)2 x +++ x U(1)n, (3.61) 
where only ¢, transforms nontrivially under U (1)a, i.e., 


ba > Pepa, by > Q for b £a. (3.62) 


For the intermediate case of 


p? = diag | pie pi pa 13 (3.63) 
= TEES a 
ny ng 
with nı fields of mass pı and nz of mass u2, the symmetry group is U(n1) x U(n2), with 


the first (second) set of fields transforming under U (n1) (U (n2)). 
One can also add quartic interaction terms, 


L= (up) (OHH) — HRP- SO Andean plea, (3.64) 


abcd 


where Až ea = Abade (from Hermiticity), and Bose symmetry projects out the parts of Aabca 
that are symmetric in ac and in bd. In general,” this reduces the symmetry to U (1), but there 


"Even the U(1) would be broken in the presence of terms like ¢4 + 614 or (6+ ot)dt¢. 
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could be a higher symmetry for specific A’s. For example, full U(n) invariance is restored 
for 


Aabed = No abSed: Ve = fel 


irau oa ad (3.65) 
=> L= (0,6)! (06) — uP e- A (p4). 
The Hermitian Scalar 
Hermitian scalars, Øa = Ø}, transform as 
[T*, ba] = —Lirhv. (3.66) 
Consistency requires that L’ = —L™, i.e., the representation is real. The Lagrangian density 
1 
L= 5 (3 pa) (I Pa) + non-derivative terms (3.67) 
implies Noether currents 
Ti = ~i (Ouba) Lavo (3.68) 
(the second term in (3.46) is absent for Hermitian fields). The special case 
_1 22 2 
L= 2 (dupa) =H PaPa —A (Papa) (3.69) 
is O(n) invariant (Problem 3.8). However, 
1 2 1 2 
L =-= (Ona) — =ball 
5 (Oupa)? — 5 batiands con 


E KabcQa bpe = XabcdPabo Pca 


has no symmetries at all in general. 


Complex Scalar in a Hermitian Basis 


A complex scalar field ¢ can always be written in terms of two Hermitian scalars as in 
(2.91) on page 23, ¢ = (dx + iġr)/ V2, where or, are Hermitian. For complex fields that 
are in the same representation as their adjoints, such as in (3.33), it is almost always 
simpler to rewrite the theory in terms of Hermitian fields. In the case such as (3.34) that 
the complex fields and their adjoints transform separately, it is still sometimes useful to go 
to a Hermitian basis (especially for formal manipulations), although the complex basis is 
usually simpler for explicit calculations. The relation between the bases is straightforward, 
but can be confusing. 

Suppose ¢ is an n-component complex field transforming with representation matrices 
L}. It is then convenient to introduce the complex 2n-component vector ® = i 


half of the components redundant. ® transforms under the reducible representation 


; Li 0 
: o , 
o= ( 0 _Lis ) ; (3.71) 


) , with 
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One can introduce 2n Hermitian fields daR, Qar, by da = ya (bar + idar), and the 2n- 


component real vector ¢, = ( - | The two bases are related by the unitary transfor- 
I 
mation i 
I äl 


where J is the nxn identity. Hence, the representation matrices for the symmetry generators 
in the Hermitian basis are 


. : 1 Li — Li* (Lt + Li*) 
L aA AS ( i ee ee J]; 3.73 
on Pe 2N illi +) Li 1 (aa) 


which are manifestly imaginary and antisymmetric for Hermitian L}. 


Examples: (a) Consider the U (1) group acting on a single complex ¢ > exp(+i8)¢, so that 
L¿ = 1. A and the representation matrices are therefore 


1 1 a 1 0 0 2 


while a finite U(1) transformation is just a rotation 
OR -ipr? { or \ _ ( cos -sinp OR 
( or) °° ġr }) \ sing  cosß r jJ te} 


+ 
(b) Consider two complex fields ¢ = ( fo ) transforming as a doublet under SU(2), 


L’ = 7'/2. (The superscripts look ahead to applications to the Higgs.) In the (reducible) 
4-dimensional Hermitian basis ¢, = (¢1 3 ¢2 p4)”, where d+ = (¢1 + id2)/V2 and 
$? = (63 + iba) /V2, the representation matrices are 


1,3 a 0 ris 2 1 7? 0 
Lg = z ( _713 0 ) 5 L h = 5 0 72 . (3.76) 


Fermions 


Now consider the case of n fermions Ya, with 
LS Wai Da = WaMarr = wi Pw = omy. (3.77) 


In the second form, 7 is the n-component column vector (Y1 Y2- Yn)? and m = mi is an 
n x n Hermitian matrix® that can be taken to be diagonal. 7 and y transform as 


[T va] = -Listo [T] = +i (3.78) 
under a symmetry transformation, and the corresponding Noether current is 


Ji, = PaVa Linto: (3.79) 


8Generalized fermion mass terms involving y% or, equivalently, non-Hermitian matrices, are considered 
in Problem 3.32 and Chapter 8. 
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As in the scalar case, the symmetry group is determined by £. One has 
L> L = pipe Pe etiblLy — pe- Emeti P Ey, (3.80) 


The kinetic term is invariant under U (n) = SU (n) x U (1), but the invariance condition for 
the mass term is 2S CINN = 

e PB lmettPE =m a [3 . E, m] =0. (3.81) 
The full U (n) is maintained for n degenerate masses, m = mj I, while the symmetry is 
reduced to U(1)” for n distinct masses. 


3.2.3 Isospin and SU (3) Symmetries 
SU (2) Isospin 


Isospin is an approximate symmetry of the strong interactions, broken by ~1%. The break- 
ing is ultimately due to the u — d quark mass differences. This is usually viewed as intrinsic 
to the strong interactions when discussing QCD, though the masses are actually associated 
with the electroweak sector. There is a comparable breaking from electromagnetism. We 
first describe a simple model of isospin in terms of the nucleons and pions. Introduce the 
nucleon (proton and neutron) and pion fields 


Y ™ nı #179 
y= ( a ) ; r=| m |, ct = —_—, n° = T3, (3.82) 


where m; = nt and 7 annihilate the states |7*). ~ and m transform as a doublet (funda- 
mental) and triplet (adjoint), respectively,? under SU(2), 


Li, = - (Li) 4, = iisk- (3.83) 


The diagonal generator is T? (L3 is diagonal in the 7*° basis). Consider the Lagrangian 
density 


Lo = $ (i -mI tig R-P) Y+ a lo z}? - Pr?) -A (z?) (3.84) 
=J (i J-mI+ i271) wt sik [(a.t)? z pT] —\(TrIP)’, 


mn T= at 
with 7? = S n? and I = 7 = — Ta) . The gr term is the Yukawa interaction 
V2 
between the pion and nucleon.'? The second (matrix) form in (3.84) makes it especially 
easy to read off the Feynman rules for the ppr°, nnx°, npr, and pnrt vertices, namely, 
=g, +9n7°, —V29n7°, and —V/2g,7°, respectively. 
We should emphasize that experimentally g» is very large: the experimental 7 N coupling 


The convention for the 7+ fields in (3.82) is common in particle physics, and convenient because 
nt = (x—)t. However, the Condon-Shortley phase conventions usually employed for states in rotational 
multiplets in panum mechanics (and in standard Clebsch-Gordan coefficient tables) would instead require 
the convention m+ = + (m1 F in2) /V2 (Problem 3.12). 
10We use the cern Yukawa interaction in a generalized sense, i.e., for any 3-point interaction between a 
spin-0 and spin-4 particles. 
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Gr observed from low energy 7N and NN interactions, which may differ from g, by strong 
interaction corrections, is ~ 13.05(8) (e.g., Gorringe and Fearing, 2004). Therefore, (3.84) 
should be viewed as a model to illustrate symmetry considerations and not as a serious 
perturbative field theory for the strong interactions. Lo is SU(2) and reflection invariant 
(the y° is because the pions are pseudoscalar); using (3.32), 


Nja 


yr etib Sy, yr pe i Tl eeb? eS, (3.85) 


from which both wy°Iy and Tr H? are invariant under SU(2). The invariance of the pion 
self-interaction is further discussed in Problem 3.15. 
Now, introduce SU(2) breaking by £ = Lo + £1, where 


Ly = —ebrsp = —e (Ypbp — Yntn) - (3.86) 
Lı thus represents a splitting between the proton and neutron masses, 
Mmp=m+e My, =m—e, (3.87) 


which parametrizes contributions both from the quark masses and from electromagnetism.!! 
It is straightforward to show that 


[T?, £1] =0, [T*?,£i] #0. (3.88) 


Thus, SU (2 ) is broken to U(1)rs, corresponding to a conserved charge T? with values 
#3, = +1,43, = 0, and t} = —t} = 1/2. Actually, £ is invariant under U(1)rs x U(1)z 
where the second U(1) dorson to a conserved fermion (or in this case, baryon) number 
Bp,n = 1,B, = 0. The conservation of B and T? is equivalent to that of B and electric 
charge Q, where Q = T? + B/2 when restricted to the fields in this example. £; transforms 
as the T = 1,T’ = 0 component of an irreducible tensor operator. One can therefore use 
the Wigner-Eckart theorem for relations between its matrix elements, in exact analogy to 
broken rotational invariance in quantum mechanics. 


SU (3) Symmetry 


SU (3) is an approximate global symmetry of the strong interactions that extends the SU (2) 
isospin subgroup. It is valid at the ~ 25% level for masses, but works better for relations 
between couplings and matrix elements. SU(3) was proposed independently by M. Gell- 
Mann and Y. Ne’eman in the early 1960s to account for the fact that the low lying mesons 
and baryons could be associated in octets (the eightfold way). As described in Section 
3.1.2, SU(3) has eight generators. The fundamental representation matrices L$ = \'/2 and 
structure constants are listed in Tables 3.1 and 3.2. SU(3) has rank 2, so two generators, 
T? and T’, can be simultaneously diagonalized with the Hamiltonian. Empirically, these 
correspond to the strong interaction quantum numbers by the Gell-Mann-Nishijima relation 


X 3 2 fs 
=I a Iz = T”, Y= T 
Q 3 2 3 V3 


where Q is electric charge, [3 is the third component of isospin, Y = B + S is strong 
hypercharge, B is baryon number, and S is strangeness (S = 0 for the pions and nucleons). 


(3.89) 


11 For simplicity we ignore isospin breaking terms in the interactions or from pe 4- g? “0° The latter are 
easily shown to come only from electromagnetism to leading order, and not from the quark mass differences. 
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The low-dimensional representations of SU(3) are the 1,3,3*,6,6*,8,10,10*, and 27, 
where the 1,8 (adjoint), and 27 are real and the others complex. The observed light hadrons 
can be assigned to the 1,8,10, and 10*. It is convenient to display the states on weight 
diagrams, with the axes corresponding to J3 and Y. Then, the other generators of the 
Lie algebra describe transitions from one state to another. For example, the lowest lying 
JS 15 baryons!? and J? = 07 mesons (J is the spin and P is the intrinsic parity) both 
transform under the adjoint (octet) representation, as shown in Figure 3.1. In the absence of 
SU(3) breaking, the states in each octet would be degenerate. Each consists of two isospin 
doublets with Y = +1, and one isotriplet and one isosinglet with Y = 0. The baryon fields, 
which annihilate states with B = 1 and strangeness S = Y — B, are given by 


ae 2 (yı F ipo), DY = Ws, A= Ws 
1 1 
5 (Ya — tvs), = (Ye — ip7) (3.90) 
B= loti), B= (Ya tits), 
while the pseudoscalar meson (B = 0) fields are 
oe (61 + iġ2), n° = $3, n = ĝs 
; (3.91) 
K+ = Z (ġa Fis),  K°(K°)= a (%6 F iġ7). 


The 7, which also has Iz = y = 0, is an SU (3) singlet. The 7, K, n,n’ system is referred to 
as a nonet = 8 + 1. 


-0 -+ 
Ma Vii Pe KY YX 1 e K 
Da EA Dr T T? N at 
——— —+ hs 
= =0 = 70 
e 4—1 e` Ky —-l o K 


Figure 3.1 Weight diagrams for the J? = Le baryon and J? = 0~meson octets. The 
anti-baryons are in a separate octet, while the mesons and their antiparticles are 
in the same octet. 


Analogous to (3.82) one can define 8-component vectors y and ¢, so that y > 


12A baryon (meson) is a hadron with half-integer (integer) spin. A hyperon is a baryon with nonzero 
strangeness (but no heavy quantum numbers such as charm). 
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exp (iB - Laaj dh, o > exp (ip - Loa)? under an SU(3) transformation. The extension of 
(3.84) to the baryon and pseudoscalar meson octets is 


- 1 e 
Lo =Y; (iØ — m) pi + z [ond — 65) + igs fije Vit Vy oe 
+ ga dijk Pity ibe — (05)? — B dijk dimn $j >nbmOns 


(3.92) 


where p? = 6); and fijk and dijk are the coefficients given in Table 3.2. There are two inde- 
pendent SU (3)-invariant meson-baryon interactions, known as the “F” and “D” couplings, 
which are, respectively, antisymmetric and symmetric in the SU (3) indices. This can be un- 
derstood as follows. An invariant coupling is just a singlet component of the direct product of 
the fields in the interaction term. For SU (2) the meson-baryon interaction in (3.84) involves 
the direct product of two doublets and one triplet, (2 x 2) x 3 = (1+3) x 3 = 3+[1 +3 +5], 
where the IRREPs are labeled by their dimensionality 27 + 1. The singlet only occurs once 
in the decomposition, so there is only one invariant. For SU (3), however, there are two 
independent ways to form an invariant from 8 x 8 x 8, 


8x8=1+8+8+10+10* +27 


8 x (8 x 8) = 8 x 1+ (8 x 8) + (8 x 8) +8 x [10 + 10* + 27]. (3.93) 
aways s 
Deze Tp 


Similarly, there are now two invariant meson self-interaction terms, associated with the 
singlet and symmetric octet components of 8 x 8. 
Lo can be written in matrix notation, using 


yn + + 
8 No; V2 T V6 se K 
= Us = =m ge 
a lor a T zta ef (3.94) 
i=1 K- K? A 
as well as o o 
Sea AL + 
D Aii A E a 
B= t= E- = + = n (3.95) 
i=l v2 z- wes ve —2A° 
= = v6 
and a. ad 
>») A yi = 
= $ NU; EG = A? = 
B= = s a t a a (3.96) 
1= p n “Ja 
Therefore, 
_ 1 _ 
Lo =Tr [B (i - m) B] + 51 (a.m) £ yom?) F an (Bix? [B, M]) 
(3.97) 


+ ae (Biy? {B, M}) — (a- ONT M*)’ — 28 Tr M*, 


which is invariant from (3.32). 
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SU (3) Breaking 


The degeneracy of the SU(3) multiplets is only good to around 25%, though the predictions 
for couplings and amplitudes are typically much better. The Gell-Mann-Okubo (GMO) 
ansatz is that the breaking can be described by an operator that transforms as an octet, i.e., 


L=Lot+egls, (3.98) 


where Lo is a singlet (i.e., invariant), €g is a small coefficient, and Lg transforms as the 8°” 
component of an octet of operators, £;,2 = 1---8: 


Uotila =i Veku = (ers) i; (3.99) 
J 
or equivalently, 
[T*, Lo] =0, — [T", £8] = — (Laai) g; £3 = tfias Ls (3.100) 


When first postulated, the actual form of £g was not known, only its transformation prop- 
erties. The power of such an ansatz is that it allows matrix elements of Lg to be related by 
SU (3) in terms of one or more parameters that can be measured (or calculated in a more de- 
tailed theory). To illustrate this, recall the Wigner-Eckart theorem for SU(2), which relates 
the matrix elements of an irreducible tensor operator!® TË, which carries angular momen- 
tum (or isospin) k and z- component q, between states a; and a2 with angular momenta 
and z components j1,2 and m 4,2: 


(a2 ja ma|Ty | jı mi) = (az jo IIT" || ar ji) (j2 Mmo | k q jı ma). (3.101) 


The double-barred quantity is the reduced matrix element, which depends on the operator 
and states, but is independent of m1, M2, and q, while the second quantity is a Clebsch- 
Gordan (CG) coefficient, which leads to selection rules, mz = q + mı and |k — jı| < 
jo < k + jı, and relations between the nonzero matrix elements. [An excellent table of CG 
coefficients can be found in the Review of Particle Properties (Patrignani, 2016, or their 
website).] Similarly, for SU(3) the matrix element of an octet operator between two octets 
can be written in terms of two quantities that depend on the dynamics and the f and d 
symbols, which are analogs of the CG coefficients.'+ Thus, 


—es(i|Ls|j) = a ifisi + Bdis;, (3.102) 


where a and ĝ are proportional to eg. The symmetry-breaking term in the Hamiltonian is 
—eg | d?Z L(x), implying that the shift in the mass mg, of baryon B, due to the SU(3)- 
breaking is 

(B,| — eg£8|B,) 


Ame, ~ 
2MB 


, (3.103) 
where the 2mp, in the denominator is from our covariant normalization convention. There- 
fore, 

MB, = MB, + CMa + camp, (3.104) 


r 


where mg, is a common (SU(3)-invariant) mass, c% q are the “CG” coefficients obtained 
from if and d by taking the appropriate linear combinations of indices, ma ~ a/(2mz,), 


13(73, TE] =qTy, [T1 + iT?, TF] = Vlk F a) (k +q +1)TE. 
14One can generalize the CG coefficients to arbitrary SU(3) representations using isoscalar fac- 
tors (de Swart, 1963; Patrignani, 2016). 


= 
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and similarly for mg. Ignoring isospin breaking, there are four masses, My = (Mp + Mn)/2, 
Ms, Mz, and Mj, which can be expressed in terms of three parameters, mB, ,a,g. The latter 
cannot be predicted a priori, but there is one linear relation, the Gell-Mann-Okubo relation 


(Problem 3.20) 
Ms+M 3M, + M; 
st N- it ZiO (3.105) 
where the result holds to leading order in es. Experimentally, the GMO relation works 


extremely well. The individual masses in GeV, 


My ~ 939, Mz ~ 1318, Ms ~ 1193, Ma ~ 1116, (3.106) 


indicate SU(3) breaking at the 20% level, but the left- and right-hand sides of (3.105) are 
1129 and 1135, respectively, equal to better than 1%. Similar formulae apply to other low- 
lying hadronic states, such as the pseudoscalar (J? = 07) mass-squares 2, Wj, u2 and the 
lowest vector (J? = 17) meson mass-squares, p42, W3¢+, H3. (One must include the effects 
of octet-singlet mixing between the 7 and 7 and between the ¢ and w. See Section 5.8.3.) 

The situation is simpler for the J? = ie states, which transform as a 10 (decuplet). 
There is only one invariant of the form 10 x 8 x 10*, so there is only one reduced matrix 
element in (10|£g|10), leading to a linear spacing in the masses as one goes to lower Y. This 
works very well, as can be seen in Figure 3.2. In fact, the Y = —2, S = —3 baryon Q7 was 
not known at the time SU(3) was proposed. The prediction of its existence and mass from 
the GMO ansatz was a great triumph for SU(3). 


p 0 + ++ 
Ay A iy gå aX 1230 
D Ti Lt 1385 
+ A d + 7 @ e © 13 
=" =L = 2 1 2 
Dg= T0 5 
z 4 ee 1530 
Oye 1672 (predicted) 
Figure 3.2 Weight diagram for the J? = i decuplet. 


When SU (3) was proposed it was generally believed that isospin was an exact symmetry 
of the strong interactions, and that the observed small (1%) breaking is due to electromag- 
netic and weak interactions. We now understand that there is a small breaking component 
intrinsic to the strong interactions as well (in part because estimates of the electromagnetic 
contribution to the proton-neutron mass difference predict a heavier proton). This can be 
incorporated by writing 

L = Lo + €3L3 + els, (3.107) 


where e3 = O(1%) and eg = O(25%). (A simple model for L3 was given in (3.86), but we do 
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not assume that specific form here.) Additional predictive power comes from the assumption 
that £3 and £s are both operators from the same octet. SU(3) is very successful, and is 
extremely useful for relating various strong and electroweak matrix elements. It is even more 
successful when extended to chiral symmetry. The existence of heavier hadrons associated 
with the charm (c) and heavier quarks suggests the possibility of considering SU(4) or 
higher. However, SU(4) is so badly broken that such an extension is not very useful. 


The Quark Model and SU (3) 


The above description of SU(3) in terms of baryons and mesons becomes simpler when 
reexpressed in terms of the three lightest quarks and their fields, u,d, and s, which trans- 
form under isospin! as a doublet (u,d) and singlet (s). Under SU(3) they transform as a 
fundamental (triplet) while the antiquarks (and the q fields) transform as 3* 


u qı BX _ ox 
q=| d|=]| g | 7&7 4, q= (ŭd 5) > Ge"? , (3.108) 
S q3 
which is equivalent to 
[Mau s Heak ql. (3.109) 
The quark electric charges and strong hypercharges are given by 
2 1 
ao ae a 1 
253 2/3 3 1 ” v3 3 9 ( ) 


The weight diagrams are shown in Figure 3.3. 
A quark and antiquark can combine to form an octet and singlet, i.e., 3 x 3* = 8 + 1. In 
an obvious matrix notation, 


uu ud us y 
qxG=| di dd dij |= M + I (3.111) 
= T a ~ 
sū sd s5 octet v3, 
singlet 
2uu—dd—ss g ud us i i 10 0 
= du laos ds + 3 (uu + dd + s5) 0 10 4], 
sū sd 2s5—uū—dd 0 0 1 


where M is the pseudoscalar octet (3.94) expressed in terms of quarks, and 7! = (uū + dd + 
s5)/1/3 is the singlet. 

The J? = ar baryons also transform as an octet, but are constructed from three quarks 
qqq: 


3x3=3*+6 


: (3.112) 
3x3x3=3x3*+3x6=1+8+8+10. 


15In addition to isospin, SU (3) has two other (non-commuting) SU (2) subgroups, U-spin and V-spin. 
In quark language, these are associated with the d + s and u + s transitions, respectively. These are 
more badly broken than isospin, but are occasionally useful for specific applications, e.g., Problem 3.21 
and (Commins and Bucksbaum, 1983). 
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es 
Ll La 
2 2 
ed eY 
t mee H — h 
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el ed 
Jal Jad 
2 2 
@s 
Figure 3.3 Weight diagrams for the 3 and 3* of SU(3). 
Thus, both the octet baryons and the J? = oe decuplet can be constructed from three 


quarks. Of course, one must also include space, spin,'® and color indices in the construction 
(see Problem 3.22); in fact, one of the reasons for the prediction of the color quantum 
number is that otherwise the Q7 would be a totally symmetric (J = 3/2) composite of 
three s quarks, in violation of the spin and statistics theorem. 

The free quark Lagrangian density is 


a=u,d,s 


= Ga (i P — Mo) da — Ms (tu — dd) — mg (ūu + dd — 25s) /V3 


= (iP — mo) Iq — msagr*q — msqr°q. (3.113) 
mce——“/—_——_“—_ “SS x 
Lo —e3ls —egleg 


Lquark Would be invariant for degenerate masses, but the mg, terms automatically transform 
as components of an octet, so the GMO ansatz for SU(3) breaking is automatically realized 
in the quark model. The u,d, and s masses are related by 


m m 
My = Mmo + Mms + —, ma = mo —m3 + È, Ms = Mo — 2ms/V3. (3.114) 


v3 v3 


They will be further discussed following the extension of SU (3) to a chiral symmetry. 


SU(m) Tensor Notation 


There is a powerful tensor notation for SU(m) (de Swart, 1963), which generalizes the 
matrix notation introduced for the adjoint in (3.31). Denote the components of a field w 
transforming as a fundamental m with a lower index, pa, and those of an antifundamental 


16For the non-relativistic quark model it is sometimes useful to invoke a mixed SU(6) spin-flavor sym- 
metry, which is quite successful in describing the baryon spectrum and properties such as magnetic mo- 
ments (e.g., Hey and Kelly, 1983; Georgi, 1999; Capstick and Roberts, 2000; Close et al., 2007; Donoghue 
et al., 2014; Patrignani, 2016). 
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( m*) field x by an upper index, y“, 
[T* pa] = -Li p, = [T*x*] = +x? L5", (3.115) 


where Li? = (Lijao = —(Lt,«)oa. Of course, the conjugate yt is antifundamental, 
(Pa)? = (Wt). Higher-dimensional representations can be formed as products of funda- 
mentals, such as Yab, which can be regarded either as an elementary field or as a product of 
two fundamentals W142». IRREPs can be formed by symmetrizing and antisymmetrizing, 
e.g., 


1 1 
Pab = = (Pab — Poa) +5 (Vab + Yoa) - (3.116) 
we 28 2 
mxm m(m—1)/2 m(m-+1)/2 


Mixed tensors, such as 7°, can again be either an elementary field or a product ax?. The 
nè can be decomposed into an adjoint and singlet, 


i ‘ 6° 6° 

Na = Na T mle + ale (3.117) 
mxm* y b 
m2—1 1 


The transformation of a general mixed tensor is 
Oak 


[T Ptn b ] = Lis pier... — [ie phie + perh Libi... q perve pih, (8.118) 


It is straightforward to prove that the antisymmetric product of m — 1 fundamentals is an 
m*, ie., 


Eom btm =H, (3.119) 
wn Snr 
m™-—1 m* 


where %1 Om = €ai--ap is the totally antisymmetric tensor in m indices. With this ma- 
chinery, it is simple to construct the SU(m) invariant operators by contracting upper and 
lower indices of the operators and the € tensor. For example, 


Pan, Want, moram Carran X T, (3.120) 


are all invariant (Problem 3.24). The first two operators are also invariant under the ex- 
tension to U(m) = SU(m) x U(1), with Ya > eda, X? > x'e, and Phin 
eilk—0e Płi:::b , The contractions involving the e tensor are not U(1) invariant. Using tensor 
notation, the 7N Lagrangian density in (3.84) becomes 


2 


Lo = 4° (i ð — m) Ya + iV 29r Y T + ; [or — pn? —x (x?) f (3.121) 


b 


= Mas and n? = mÈ Th 


3.2.4 Chiral Symmetries 


We saw in Section 2.7.2 that one can view the left (L) and right (R) chiral projections 
WL, R = Pr.rw of a fermion field w as independent degrees of freedom, with Y = wr + wr. 
The significance of the chiral projections is that they can have different transformation 
properties under a global or local symmetry group G. Suppose 


[T var] = -Liapor [T*,var] = -Lator (3.122) 
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If Li, # Li the transformation is chiral; otherwise it is non-chiral. For example, the weak 
interactions are associated with a chiral gauge symmetry, implying parity violation. The 
strong interactions obey a non-chiral gauge symmetry, but have approximate chiral global 
symmetries. Even for a chiral symmetry, the fermion representation matrices may be re- 
ducible, with some of the fermions chiral (i.e., their L and R components transform dif- 
ferently), and others non-chiral or vector (i.e., they transform the same way). The fermion 
Noether current for a chiral symmetry is 


Ji, = Par Yp (Li) ap VOL + Varn (Le) op Vor (3.123) 


which reduces to (3.79) for the non-chiral case Li, = Lh = L’. In matrix notation with 


b= (ree tn)?, 
a i^- , , , , 
Ji = Pou [LPs + LigPr] Y = 50% [(L} + Lh) - (L4 - LR) (8124) 


where Li Pr +L PR contains both Dirac and internal indices. Under a finite transformation, 


r> Prp, pr ePEnpr, qp eP PPHP], (3.125) 
As an example, consider the free fermion Lagrangian density 


L= Wai Da = VaMarr 


sh K - n (3.126) 
= Wart par + Vari Yar — VaLMarWoR — PYaRMab oL, 


for a,b = 1---n. m is an n X n mass matrix which we assume to be Hermitian (one 
can generalize to a non-Hermitian mass matrix, or, equivalently, one involving Pr, pr, as in 
Problem 3.32). The kinetic part of £ is invariant under the chiral flavor symmetry U (n)z x 
U(n)r = SU(n)_ x SU(n)r x U(1)y x U(1)a. The subscripts L and R have no group 
theoretical significance. Rather, they indicate that the L-chiral fermions transform as an n 
under SU (n)z, but as a singlet under SU (n)pr, and the reverse for the R-chiral fermions, 


SU(n)L: Wat > (er) WoL, WaR > VaR 
ab —_—— 
-——“_ 1 
ý -3 (3.127) 
SU(n)r: at > PaL, VaR > CaaS Wor; 
< ab 
1 -—~ 


n 


where the r and r are independent parameters. The non-chiral (vector) U(1)y generator 

is the sum of the U(1)z and U(1)pr ones. It corresponds to fermion number, i.e., YaL,R > 

ePpaL R. The avial U(1)4 generator is the difference of the U(1)z,r ones. In QCD the 

U(1)a generator is broken by non-perturbative effects, as will be discussed in Chapter 5. 
The mass term Yrmyr + h.c. is only invariant for 


e~t r Ln metr En — m, (3.128) 
First consider the case that m is diagonal with degenerate entries, m = m; T: 


(a) For m, #0 the mass term breaks the chiral symmetry to the subgroup with B p= B. R- 
This corresponds to the non-chiral SU (n) with Li, = Lh = Li. 
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(b) For m = 0 the full SU(n); x SU (n)r x U(1)y chiral symmetry is preserved, i.e., chiral 
symmetries involve vanishing fermion masses. 


Of course, one can have hybrid situations, e.g., in which m is diagonal with blocks of 
degenerate eigenvalues. Each block with k equal but nonzero masses will correspond to a 
non-chiral SU (k) x U(1)y subgroup of the original symmetry, while a block with / massless 
fermions will have a chiral SU (l)z x SU(l)r x U(1)y symmetry. 

It turns out that the quark masses are m, < mq ~ a few MeV, which is much less 
than typical hadronic mass scales. Thus, the strong interactions have an approximate 
SU(2)r x SU(2)p chiral flavor symmetry, explicity broken by my,q. This is usefully ex- 
tended to SU(3), x SU(3)pr, though the latter is broken by the much larger ms = O(100) 
MeV. The precise meaning of these current (Lagrangian) quark masses, and of the con- 
stituent quark masses ~ Mp,n/3 ~ 300 MeV, will be discussed in Chapter 5. 


3.2.5 Discrete Symmetries 


In addition to the continuous symmetries, many theories have discrete symmetries charac- 
terized by a group with discrete elements. As an example, consider a single Hermitian field 
og, with 


_1 2_1 2,2 BP AGE 
L= 5 (0.9) 5H +a 3 "E (3.129) 


For general couplings £ has no internal symmetries. However, for the special case a = k = 0, 
L£ in invariant under the discrete two-element group Z2 = {I, R}, where ¢ > —¢ under R. 
The Z symmetry has the obvious consequence that the number of particles is conserved 
modulo 2 in any reaction. 

As another example, the Lagrangian density for two Hermitian fields 


Aud, Azo Ag? 3 


aa 4 4 2 


5 (Oubr)? + 5 (ude)? -A (3.130) 
is O(2) invariant for Ay, = Aw = A12. For Ay. = A22 Æ A12 the symmetry is broken to a 
discrete subgroup consisting of rotations by {0,7/2,7,37/2} and of reflections times the 
same rotations, i.e., to the interchange between ¢; and ə as well as possible sign changes 
for one or both fields. For 411 Æ Azz only the sign changes survive. Discrete symmetries are 
not associated with conserved charges and often (as in the Z and the A11 Æ A22 examples 
above) do not imply that particles fall into degenerate multiplets. Their major consequence 
is usually to relate couplings or to remove certain couplings that would otherwise be possible, 
and they are often introduced ad hoc in model building for that purpose. They may also 
come about as a low energy remnant of a continuous symmetry in an underlying theory 
that is valid at high energies. 

A more realistic example is G-parity, which is an approximate discrete symmetry of the 
strong interactions: 


G= Cert, (3.131) 


where C is the charge conjugation operator and T? is the second generator of the SU(2) 
isospin group. The mesons consisting of u and d quarks and antiquarks are eigenstates of 
G-parity. G-invariance, which follows from QCD if isospin breaking is neglected, leads to 
important constraints on their interactions. For example, the pions have G = —1 while the 
n has G = +1. G-parity therefore forbids the decays 7 — 37 as well as any transitions 
between an odd number of pions by the isospin-conserving strong interactions. (7 > 37 
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does proceed much more slowly through isospin breaking effects, especially the u — d quark 
mass difference.) Consequences for nucleon matrix elements are considered in Appendix G. 

The G parity assignments can be justified experimentally. For the pions, C7°C~! = n? 
since 7° — 2y is observed, and Cr+C~! = +n7 in our phase conventions. Similarly, the 
observed 7 + 2y decay implies CnC! = +n. The G-parity then follows because the 7 and 
7 are, respectively, an isotriplet and isosinglet. The assignments also follow from the quark 
model. It is shown in Problem 3.27 that any color-singlet meson |qi G2) with q1,2 = u or d has 
G = (—1)/+S+!, where L, S, and J are, respectively, the total orbital angular momentum, 
spin, and isospin. The m and 7 have L = S = 0 so Gy = —G, = 1. Similarly, the vector 
mesons p° and w (Section 5.8.3) are mainly glut + dd), respectively, in an L = 0,5 = 1 
state, so Gp = —G,, = 1 and the decays p > 27 and w — 3r are allowed. 


3.3 SYMMETRY BREAKING AND REALIZATION 


Symmetries of the equations of motion may be broken explicitly by small terms in the 
equations themselves, or spontaneously in the solutions. Simple quantum-mechanical analogs 
of many of the possibilities, including no breaking, explicit breaking, spontaneous breaking, 
and combined explicit-spontaneous breaking for discrete and continuous symmetries, are 
described in Appendix H. More extensive discussions than the one here can be found in, 
e.g., (Lee, 1972; Pagels, 1975; Coleman, 1985). 


3.3.1 A Single Hermitian Scalar 
Consider the Lagrangian density 
1 
£L=5(,) -V (4) (3.132) 
for a single Hermitian scalar, where the potential is 
_ pd? Agt 


e (3.133) 


V (4) 


£ has no continuous internal symmetries, but does have a discrete Z2 symmetry under 
$ — —ọ (this would not be the case for the more general potential in (3.129)). The equation 
of motion for ¢ is 


(ga - 97) 6=-Fo =- Mta] (3.134) 


First consider the solutions to (3.134) for a classical field dciass- The lowest energy classical 
solution can be interpreted as the vacuum expectation value (VEV) or the ground state value 
of ¢, (0|¢|0) = (¢). This can be thought of as a coherent state or Bose condensation effect 
(cf., classical solutions to Maxwell’s equations). From the expression for the Hamiltonian 
density in (A.8), the lowest energy solution is for ¢(t, Z) = constant,‘” with a value () that 


minimizes the potential 


av ov 
Y| ee © >o, (3.135) 
09 lig OP? | (4) 


17 Any x dependence of the ground state would also violate translation invariance, while any VEV for 
higher-spin fields would violate Lorentz invariance. However, as we will see in the next section, higher energy 
classical solutions are merely excitations on the ground state and can involve x dependence or higher spin 
bosons. 
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where the first condition guarantees an extremum, and the second that the solution is stable 
(a minimum). One must choose A > 0 so that V is bounded from below. However, the sign 
of u? is arbitrary. The shape of V(¢) is shown in Figure 3.4 for both signs. u? > 0 is the 


, Vid) ; 


p(x) 


Figure 3.4 Left: potential in (3.133) for u? > 0 (dashed), u? < 0 (solid), or u? < 0 
with an explicit symmetry breaking term —a¢ (dotted). Right: domain wall solution 
of the classical field equation for ¢. 


familiar case. From (3.134) or Figure 3.4 the minimum occurs for (¢) = 0, i.e., the vacuum 
is just “empty space.” One can then quantize ¢ as in Chapter 2, u is the mass of the scalar 
particle, and the Z2 symmetry is unbroken. 

For u? < 0 there are three extrema, at ¢ = 0 and at ty = +,/—p?/X. The extremum 
ġ = 0 is a maximum; it is unstable, since for a small initial perturbation ¢ > 0 


d2 
a ~p’ > 0. (3.136) 


The other two are minima, as can be seen by rewriting 


À 2 Avt 
V(¢) = = (¢? - r’) (3.137) 
4 4 
Thus, there are two possible degenerate ground states, with (6) = +v. One can quan- 


tize around whichever of them is (randomly) chosen, and the Z2 symmetry (¢ > —¢) is 
spontaneously broken. For the + solution, one can write 


og=v+¢, (3.138) 


where ¢’ is a normal quantum field with (¢’) = 0. With this substitution, 


1 
LH) =L +6) = 5 O — V8) (3.139) 
with n 
1 ZH ee 24/2 13 Ard 
V= EN tre + Fb. (3.140) 


The first term is a constant, which is irrelevant until gravity is considered (then it becomes 
a contribution to the cosmological constant). The second term shows that the ¢’ field 
corresponds to a particle with (zeroth-order) mass-squared p3, = —2u? > 0. The third is 
a cubic self-interaction for ¢’ induced by the spontaneous symmetry breaking. It is due to 
the quartic ¢ interaction, with one of the legs replaced by (¢) = v, as seen in Figure 3.5. 
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Figure 3.5 Quartic and induced cubic self-interactions for ¢’. 


The induced cubic interaction is a manifestation that the discrete Z2 symmetry of £ is 
spontaneously broken by the ground state (classical) solution, and has the effect that particle 
number modulo 2 is no longer conserved. The last term is a quartic self-interaction that is 
unaffected by the symmetry breaking. 

For the intermediate case u? = 0 it is not sufficient to consider the theory classically. 
One must extend the discussion to consider the effective potential, which reduces to the 
potential at tree-level in an expansion in the number of loops. The model considered here is 
difficult to study; the apparent minimum is at nonzero v but occurs outside of the range of 
validity of the expansion. However, in more realistic theories, such as the standard model, 
the symmetry is spontaneously broken for u? = 0 (Coleman and Weinberg, 1973). The 
effective potential is extremely useful for incorporating higher-order effects in the study of 
symmetry breaking and for considering field theory at finite temperature (see, e.g., Dolan 
and Jackiw, 1974; Weinberg, 1995; Quiros, 1999). 

One can perturb the potential in (3.133) by linear or cubic terms that explicitly break 
the Z symmetry, as in (3.129). For definiteness, we will consider the linear tadpole operator 
—ag¢ with a > 0, 

t 

V ($) = 1" 
An immediate consequence is that the Z2 symmetry must be violated in the ground state 
as well, i.e., (6) 4 0. For u? > 0 the VEV is induced by the explicit breaking, v = (¢) = 
a/u? + O(a3). The most important consequence is the presence of a small cubic term for 
the shifted field & = ¢— v, 


ee (3.141) 


ye À 
VP) = Fer wg? + THE + Ola’), (3.142) 


similar to the (larger) one in (3.140). For u? < 0 the potential is shifted, as seen in the 
dotted curve in Figure 3.4. The deepest (global or true) minimum is at 
a 


v=n+ za t O(a’), (3.143) 
0 


where vo = \/—p?/X is the unperturbed minimum. For small enough a there is another 
metastable local or false minimum for v < 0. Such metastable local minima are frequently 
encountered in field theories, and in some cases it makes sense to quantize around them 
rather than around the global minimum (e.g., Kusenko et al., 1996; Intriligator et al., 
2006; Degrassi et al., 2012; Camargo-Molina et al., 2014). The relevant issues for a realistic 
theory with a metastable vacuum are: (a) Is the lifetime of the metastable vacuum due 
to tunneling (Linde, 1983) long compared to the 101° year age of the observed Universe? 
(b) Which vacuum would have been occupied initially as the Universe cooled? 
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3.3.2 A Digression on Topological Defects 


There are also more energetic classical solutions to (3.134). For example, the static domain 
wall solution is an infinite wall perpendicular, e.g., to the x direction, with $(x) varying 
from ¢(—co) = —v to $(+00) = +v. This is illustrated in Figure 3.4, where the center of 
the wall is at x = 0 and the wall is parallel to the y and z directions. Energy is stored in 
the wall in the transition region near x = 0. The thickness d of the transition region can 
be estimated by minimizing the sum of the kinetic energy density ~ (v/d)? = |u|? /Ad? and 
the potential energy density (with respect to the minimum) ~ ut/A, leading to d ~ 1/|p). 
Thus, the energy density per unit area is ~ du*/X ~ |y|>/X. Since the wall is infinite in 
extent, it would take infinite energy to tunnel to one of the ground states ¢(x) = +v, so 
the wall is stable. (Such objects are known as topological defects.) 

This simple model illustrates a generic difficulty with spontaneously broken discrete 
symmetries. Such walls would presumably have formed in the early Universe as it cooled 
from a temperature T much larger than |u| because causally disconnected regions would have 
fallen randomly into either of the two minima, somewhat like the formation of ferromagnetic 
domains. Both walls and anti-walls (making transitions from +v to —v) would have formed. 
Most would presumably have been annihilated, but one would expect at least one to survive 
in a volume V ~ R? of the size of our observable Universe, contributing to the energy density 
and anisotropy of our Universe. To get an idea of the magnitude, let us assume the average 
energy per unit volume due to a single domain wall is bounded by the observed average 
energy density (tot, 


energy lel? 


3 4 
alma aR (3 x 107%eV) . (3.144) 


Of course, this underestimates the constraint since the observed energy density is ex- 
tremely isotropic, unlike a domain wall. Using R ~ 1.4 x 10!° yr ~ 4 x 10!" sec, this yields 
|u| /A1/3 < 30 MeV. Discrete symmetries spontaneously broken at a larger scale are therefore 
cosmologically dangerous. !8 

Spontaneous symmetry breaking in particle physics can lead to other defects of possible 
cosmological relevance, which may involve classical values for gauge fields as well as scalars. 
These include monopoles, which occur when non-abelian symmetries are broken down to 
a subgroup containing a U(1) factor, and cosmic strings, associated with U(1) symmetries 
(see Problem 4.1). In another class are textures, which are not topologically stable but may 
be long-lived. These matters are discussed in much more detail in (Coleman, 1985; Vilenkin, 
1985; Kolb and Turner, 1990; Brandenberger, 2013). 


3.3.3 A Complex Scalar: Explicit and Spontaneous Symmetry Breaking 


Consider a complex scalar ¢ with 


Lo = (ud)! P-V (H), V(b) = wtb + A(ot4)’, (3.145) 


where à > 0 so that V is bounded from below. As discussed in Section 2.4.1, Lo is invariant 
under the U(1) phase transformations ¢ + et’ ġ, with a conserved charge corresponding to 
particle number. It is convenient to go to a Hermitian basis, ¢ = (¢1 +i¢2)/V2, as in (2.91), 


18They can be avoided if the reheating temperature of the Universe after a period of inflation is smaller 
than u. Also, the addition of a small explicit symmetry breaking term —K¢? or —ad to (3.133) would 
eliminate the problem; the energy difference would lead to an attractive force between domain walls and 
anti-walls and therefore more rapid annihilation. 
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so that 


1 
Lo == 


Š À 
5 (0u + (Aub2)?] -V (1,40), V = ++ (67 + 98)". (8.146) 


As shown in (3.75), the U(1) transformation takes the form of a rotation’? in this basis 
Qı cos8 —sin O 
( AET E: A (3.147) 


(In fact, U(1) and SO(2) are equivalent.) Again, the vacuum corresponds to a classical 
solution of the field equations 


Ə > 
(aP) de =— [PHAGE + A 6 Cs 
with minimum energy. This implies a constant Va = (Qa), with 
OV 
= 0. 3.149 
Balan ene 


The minimum condition requires that the eigenvalues m? 5 of 


OV əv 
3p? 313p 
OV əv 


0¢10¢2 0¢3 


(3.150) 


1,2 


must be non-negative. (These are interpreted as the mass-squares of the physical mass 
eigenstate particles when one expands around the minimum.) 

For u? > 0, the minimum is at vı = vz = 0, as seen in Figure 3.6. One can quantize 
around this point, obtaining degenerate $1,2 with mass-square u? and the relations between 
the quartic couplings unbroken (or, equivalently, degenerate @ and ¢!, with a conserved 
particle number). Thus, there is an unbroken symmetry in both the equations of motion 
and the ground state. This is known as the Wigner- Weyl realization of the symmetry. 

There are two ways in which a Lagrangian symmetry can be broken. One is to add small 
explicit breaking terms, as we did for SU(3) in (3.113). For example, with u? > 0 and 


f= fy- 503 (3.151) 


the U(1) ~ SO(2) symmetry would be broken, with nondegenerate masses 


m =p, mi =u +e, (3.152) 
corresponding to the Hermitian mass eigenstates ¢, and ¢2. In this example, the quartic 
relations are not modified at tree-level, though there would be finite corrections induced at 
loop-level.?° However, there would no longer be a conserved particle number (Problem 3.29). 

The other possibility is spontaneous symmetry breaking (SSB), also known as the Nambu- 
Goldstone realization of the symmetry (Nambu, 1960; Nambu and Jona-Lasinio, 1961; Gold- 
stone, 1961). For u? < 0 the Mexican hat (or wine bottle) potential in (3.146) has its minima 


19The full symmetry of Lo is O(2). The extra ¢2 > —¢2 reflection symmetry corresponds to ¢ © ġÝ in 
the complex basis. 

20They are finite because the symmetry breaking is soft, i.e., the coefficient has a positive power in mass 
units. 
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Figure 3.6 Left: the potential in (3.146) for u? > 0. Right: potential for u? < 0. 


away from the origin, as seen in Figure 3.6. The rotational symmetry leads to a circle of 


degenerate minima at 
2 


Ptr = >0, (3.153) 
as illustrated in Figure 3.7. The true ground state must pick a specific point on this circle, 
spontaneously breaking the rotational symmetry in much the same way that the spins in a 
ferromagnetic domain line up in a specific direction. Since the classical field carries a U (1) 


charge, the symmetry is broken and the charge is not conserved. 
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Figure 3.7 Top view of V(¢) for u? < 0. The origin is a local maximum (unstable). 
The points on the dashed circle with radius v are degenerate minima. 


Another consequence of the symmetry breaking is the existence of a massless Nambu- 
Goldstone boson. The Nambu-Goldstone theorem states that for every spontaneously broken 
generator of a continuous global symmetry there exists a massless spin-0 particle, the Gold- 
stone boson. 

Let us demonstrate these statements in this example. Without loss of generality, one can 
choose axes in the ¢; 2 plane so that the ground state is at vı = v and vz = 0. (Equivalently, 
for any choice of ground state one can make an SO(2) rotation so that this holds for the 


rotated fields.) Then, define 
di=V+o, G2 = $z, (3.154) 
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where (¢/) = 0. One can quantize ¢/ in the usual way. In terms of ¢/, 


1 1 
ka (app? + 5 (ap) — V (Hi, 64) 


4 
v= ACETRE CETE 


(3.155) 


Similar to the Hermitian field example, the first term is a constant, the second implies that 
ġi has mass-squared m? = —2u? > 0, the third is an induced cubic interaction which implies 
there is no conserved charge, and the last is the quartic interaction. There is no mass term 
for $4, i.e., it is a massless Goldstone boson. The interpretation is that the potential is flat 
as one moves from the minimum in the ¢ direction, so that excitations from the ground 
state are massless “rolling” modes rather than massive oscillations. 

It is possible to combine explicit and spontaneous symmetry breaking. Suppose there is 
a small explicit breaking term as in (3.151), with u? < 0 and 0 < e€ < |y|?. This tilts the 
potential in Figure 3.6 so that there is a minimum at (3.154) (which is unique except for 
¢, > —v), with 

m? = —2y", m =e Km. (3.156) 

Thus, the Goldstone boson acquires a small mass from the explicit breaking (it is known as 
a pseudo-Goldstone boson). 


3.3.4 Spontaneously Broken Chiral Symmetry 


Let us extend the discussion of spontaneous symmetry breaking to a chiral symmetry. 
Consider a single chiral fermion Y = wr + Yr, and a complex scalar ¢, with 


L= bride. + bri Pyr —hbrvrad—h* debe + (0,0)! p- V (Q) 
with V(¢) =p26'6+A(¢t¢)’, (3.157) 


where À > 0. Without loss of generality we can take the Yukawa coupling h to be real 
and positive (if necessary by absorbing any phase into wr, > etrs h), which is not a 
symmetry). £ has a chiral U(1) symmetry?! 


pọ — eg, YL > YL, bre yp, (3.158) 


which forbids elementary fermion mass terms. For u? < 0 the symmetry is spontaneously 
broken. The scalar part of £ can be rewritten in terms of the shifted fields defined in (3.154), 
just as in (3.155). The Yukawa terms become 


hv - i h .- 
Era =~ abo (14 E) - eid, (3.159) 


where we have used Wipe + YrYL = vy and wpe — YrYL = Wy°v. Thus, ~ acquires an 
effective mass h 
V 


Thur = 
A 


from the spontaneous breaking of the chiral symmetry. The Hermitian scalars %1 couple 


(3.160) 


as a scalar and pseudoscalar, respectively, to 7, with a Yukawa coupling h/V/2 = My/V. 


k 21 The individual chiral transformations of the %z and wR are somewhat arbitrary. All that matters is that 
bivR > etf rL Yr. Changing the prescription is equivalent to adding terms proportional to the unbroken 
fermion number generator to the chiral charges. 
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3.3.5 Field Redefinition 


In field redefinition one introduces new fields as functions of the original ones. We saw simple 
examples in Section 2.10 involving the change in the overall phases of complex scalar or 
fermion fields. In some cases, such phase rotations correspond to U(1) symmetries. However, 
they are also useful in the absence of a symmetry for removing or changing the phases of the 
constants appearing in £. Field redefinitions were also encountered in Section 2.14 where 
they were used to put kinetic energy and mass terms in canonical form. Yet another example 
was seen in the model of a spontaneously broken chiral U(1) symmetry described in Section 
3.3.4, which can be described in terms of the shifted fields ¢{ p with interactions given in 
(3.155) and (3.159). 

It is sometimes convenient to rewrite the theory in terms of new fields that are more 
complicated functions of the original ones. Such field redefinitions can lead to theories that 
look very different from the original ones, though for a wide class of cases they are equivalent 
in the sense that they have the same on-shell S$ (transition) matrices (Haag, 1958; Coleman 
et al., 1969; Callan et al., 1969). In the case of SSB, instead of the rectangular Hermitian 
basis in (3.146), one can define the fields in a polar basis, i.e., 


j= ae eil (3.161) 


where 7 and € are Hermitian fields. 7 is invariant under a U (1) transformation, while € is 
shifted 


n>n oan (3.162) 
The ¢ part of £L becomes 
1 1 n]? 
Lo = 30m +308 [1+ 2] -vo 
A i (3.163) 
Vin) = I © Pn? + Avn? + ni 


i.e., € drops out of the potential and is therefore massless, illustrating that the Goldstone 
boson is just the phase of ¢. The self-interaction terms in V for 7 are the same as those 
for ¢ in (3.155), and m} = —2y?. Both n and € have canonical kinetic energy terms 
(the 1/v normalization of the phase in (3.161) was chosen for that purpose), but there 
are now derivative couplings of (0,,£)? to 7 and 7 with coupling strength 1/v, which enter 
through the ¢ kinetic energy terms. These replace the ¢) interactions in (3.155). The Yukawa 
interactions in (3.159) become 


hv - nN\ ; 
| ee cs (1+ 2) eE 4 he., 3.164 
Yuk Ja veer 7 ( ) 
i.e., there are interactions with € to all orders. However, this too can be simplified by a 
redefinition l 

Yr = eE yp. (3.165) 


€ disappears from Lyuk, but reemerges as a derivative coupling to the right-chiral fermion 
that enters from Yri Yr. Altogether, L = Lf + Le, where 
= OnE u hv - n 
= — — —= 1 1) i .1 
Ls = pi dh + —-drde aie ( i (3.166) 
The prime on Yr has been dropped. It is straightforward to show that the derivative cou- 
plings of 0,,€/v in (3.163) and (3.166) are to the Noether current associated with the U (1). 
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The reformulation of the model in the polar basis is not manifestly renormalizable, due 
to the derivative couplings and/or the higher-order couplings in (3.164), and for consider- 
ing high energy or higher-order calculations the original rectangular basis is more useful. 
However, the new variables define an effective theory that is very useful at low energies. The 
physical particle content and nature of the Goldstone boson are transparent. Also, all of 
the € interactions involve derivatives, which translate into factors of the € four-momentum 
pe in physical amplitudes. Therefore, such amplitudes must vanish for pe —> 0. This finds 
application in the soft pion theorems in the more realistic case of QCD. Even for pe 4 0, the 
formulation is useful for small external momenta compared with v, where the interactions 
are small. A similar field redefinition is useful for displaying the physical particle content 
in spontaneously broken gauge theories (the Higgs mechanism). Calculations in the various 
formulations are compared in Problem 3.30. 


3.3.6 The Nambu-Goldstone Theorem 


As we saw in an example in Section 3.3.3, there are two possible realizations of an exact 
continuous symmetry of the Lagrangian (the Goldstone alternative). In the Wigner-Weyl 
realization, the ground state also respects the symmetry. Particles fall into degenerate mul- 
tiplets with relations between their couplings, and for a chiral symmetry the chiral fermions 
are massless. In the Nambu-Goldstone realization, the symmetry is not respected by the 
vacuum, massless Nambu-Goldstone bosons appear, and chiral fermions acquire effective 
masses. The converse statement is Coleman’s theorem (Coleman, 1966), i.e., that a continu- 
ous symmetry of the vacuum is also a symmetry of the Lagrangian. The various possibilities 
for the realization and breaking of continuous symmetries are summarized in Table 3.5. 


TABLE 3.5 Possibilities for the realization and breaking of a continuous symmetry. 


Exact Lagrangian Symmetry ((Uc, £] = 0) 


Wigner- Weyl: Ug|0) = |0) Nambu-Goldstone: Ug|0) 4 |0) 

Exact symmetry Spontaneous symmetry breaking 
Degenerate multiplets Chiral: fermions acquire mass 
Conserved charges Global: Goldstone bosons 

Relations between interactions Gauge: gauge bosons acquire mass by 
Chiral: massless fermions Higgs or dynamical mechanism 


Gauge: massless gauge bosons 


Explicit Breaking ([Uc¢, £] 4 0) (global only) 


Multiplet splitting, etc. Multiplet splitting, etc. 
Chiral: fermions acquire mass Goldstone bosons acquire mass 


Let us extend the proof of the Nambu-Goldstone theorem at tree level to an arbitrary 
scalar sector, which can always be represented by n Hermitian scalars ¢,, a = 1---n. One 
can write the n fields as a column vector ¢ = (¢1 $2::-@n)", and their possible VEVs by a 
column vector v = (b) = (v1 v2- Vn)! . Some or all of the va may be zero. The Lagrangian 
density consists of the kinetic energy terms for ¢, a potential V(¢), and possible terms 
involving fermions. The most general renormalizable potential is the quartic polynomial 


V(d)=Vo~ Ja Qa + 5i2rba by + Katea Po be + Aatedba Po Pe ba (3.167) 
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The coefficients o,, jis. Kabe, and Aabca are real and symmetric in the indices. v is determined 
by minimizing the potential, i.e., (OV/0¢q)|,, = 0, as in (3.149). Define the shifted fields 
P = Pa — Va With (¢/,) = 0 and rewrite V in terms of ¢’, 


ae io i ` 
V(¢') =V + 5 hav by + Rabe by Pp Pe + Aadbed Py Pp Po Pas (3.168) 


a ( 8V ) 
Hab = | ap a 
IpaO po 
A 1 oV 
Aabcd = 
4! 06a9,0b-9bd 


There is no linear term in V(¢’) since we are at a minimum. The eigenvalues of the mass- 
squared matrix ĝ2, are the physical mass-squares, and the eigenvectors are in the directions 
of the mass eigenstate fields. The eigenvalues are guaranteed to be non-negative by the 
definition of a minimum. 

Now consider the role of a continuous internal symmetry, with generator representation 
matrices L’. Under an infinitesimal transformation (and returning to the original fields) 


where 


. _1f av 
yo Nabe = 31 \ 86,0600. 


= Aabed: 
v 


4 (3.169) 


V(¢) => V ($) + 6V(9), (3.170) 
where ay ay 
ôV (4) = ap,” =i (5. E) aoe (3.171) 


Invariance under the transformation requires 6V(¢) = 0, and therefore 


‘po = 0. (3.172) 


Differentiating (3.172) with respect to ġe and evaluating at ¢ = v, one has 
fiz, (Lv), =0, i=1---N. (3.173) 
Let us label the generators so that 
L'v=0, i=1---M, Lv#0, i=M+1---N. (3.174) 


The subgroup G” € G generated by T!---T™ leaves the vacuum invariant, T*|0) = 0, so G” 
is unbroken.?” However, symmetries associated with the remaining N — M generators are 
spontaneously broken, T"|0) 4 0, as can easily be seen from (3.29). (M = 0 [G completely 
broken], and M = N [no breaking] are special cases.) From (3.173) and (3.174) the scalar 
mass-square matrix has N — M linearly independent eigenvectors L'v with eigenvalue zero. 
Thus, there are N — M massless Goldstone bosons, one for each spontaneously broken 
generator. u2, also has p = n — (N — M) (generally) non-zero eigenvalues, corresponding to 
p (generally) massive scalar particles. 

The Nambu-Goldstone theorem holds quite generally.?° It can be proved to all orders 


?2The generators in (3.174) may be linear combinations of the original ones. The precise meaning is that 
there are M linearly independent generators for which Liv = 0 and N — M for which Liv Æ 0. 
23A way around this, for the axial U(1) generator, will be described in Section 5.8.3. 
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using the effective potential. It even holds non-perturbatively (Goldstone et al., 1962; Wein- 
berg, 1995), as will be shown in an example in Section 5.8. Goldstone bosons do not appear 
to exist in nature. However, pseudo-Goldstone bosons, which acquire a small mass from 
explicit breaking, are relevant to the chiral flavor symmetries of the strong interactions.” 
Furthermore, as will be seen in Chapter 4, for a gauge symmetry the Goldstone bosons are 
reinterpreted as the longitudinal modes of massive gauge bosons (the Higgs mechanism). 


3.3.7 Boundedness of the Potential 


To have a sensible theory there should be a lowest energy state, i.e., that the vacuum is 
stable, or at least a local (metastable) minimum. Absolute stability requires that the full ef- 
fective potential, including loop effects, running couplings, and possible non-renormalizable 
contributions, is bounded from below within the range of the validity of the theory. However, 
let us consider the simpler question of whether the renormalizable tree-level potential V in 
(3.167) is bounded from below for all values of the fields. V is well-behaved for finite ¢, so 
all we have to do is make sure that it is bounded from below as ¢ > oo for all orientations 
of the n-dimensional vector ¢. It suffices to consider the case ¢(a) = constant, and since 
we are concerned with the large ¢ behavior we can ignore spontaneous symmetry breaking. 
An arbitrary (constant) ¢ can be written?° 


Pa = Tea, (3.175) 


where r = (2a g2)! 2 is a radius vector and é is an n-dimensional unit vector with compo- 
nents €a. One requires 
lim V(r, ê) = c(ê), (3.176) 
Too 
where the limit c(é) is either a finite number or +00 for all orientations ê. If V —> —oo for 
any ê there is no well-defined ground state and the theory is unstable. 

In most cases the quartic terms in V determine the asymptotic behavior. If C(é) = 
Aabed Ca€becead > O for all unit vectors ê, then the theory is bounded, while if C(é) < 0 
for any ê the potential is unbounded. If C(é) = 0 for some directions ê, then one must 
investigate the other terms in V along those directions. 

It sometimes happens that there are some directions for which V(r, é) is independent of 
r, i.e., the quadratic, cubic, and quartic terms vanish so that V(r, ê) = Vo. This is especially 
common in supersymmetric theories (which have Vo = 0). If such flat directions correspond 
to the minimum of V the ground state is not uniquely defined at the renormalizable tree- 
level. In some cases, loop corrections to the effective potential, higher-dimensional (non- 
renormalizable) terms, or soft supersymmetry breaking terms lift the flatness to yield a 
unique minimum. 

As an example, consider the potential 


1 


1 
V ($1,92) = ZHI dt + 52 P2 + ro oa, (3.177) 


with Az > 0. One has 
o= ( or ) = rê, (3.178) 
2 


24Possible pseudo-Goldstone bosons associated with the breaking of a symmetry by anomalies (Sec- 
tion 4.5) are known as azions. Applications to the strong CP problem and to dark matter are mentioned 
in Chapter 10. 

25This form is also often useful for finding the minimum of V. 
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where r = (¢7 + ¢3)'/? and ê = (cos@ sin@)” is a two-dimensional unit vector. The quartic 
coefficient is thus C(é) = A2sin* 6, which is positive for all directions except sin@ = 0. 
In that case (¢, = +r, d2 = 0) the potential is bounded below, flat, or unbounded for 
u? > 0,0, or < 0, respectively. 


3.3.8 Example: Two Complex Scalars 


As a major example of some of these considerations, let 


_ Pi tip _ $3 tiga 
v2 v2 
be two complex scalars with Hermitian components ¢;, i = 1---4, and Lagrangian density 
L= |d,¢r)? + lupi? — V(r, brr) 
V = ufor? + eerlorr|? + Arloz|* + Aror + Arlero? (3.180) 
- Agrorr — A*$) Oh, 


Qr on (3.179) 


where, e.g., |o7|?7 = bier and |¢;|4 = (otor). For V to be bounded below it is sufficient 
to require that the quartic terms are positive for all values of 67, J = I,II, which holds 
provided 

ALII >0, Ari > ee ve (3.181) 


(The limiting case Ay = Azz = —Azrrr/2, which occurs in supersymmetry, is considered 
in Problem 3.35.) uł are real, and A is an arbitrary complex number. £ has an internal 
U(1) symmetry, ¢7 > eP oy with qr = —qrr = 1 (which is promoted to U(1) x U(1) 
for A = 0). W.l.o.g. one can redefine the phases of Øz, zy to make A real and non-negative. 
Then, if the parameters are such that the ¢; both acquire VEVs, the minimum will occur for 
(br) (rr) real and positive. We will assume that the individual VEVs are real and positive, 
i.e., 41,3 = (1,3) > 0 and v24 = (¢2,4) = 0. The U(1) symmetry implies that there are 
degenerate minima that differ from this by U(1) transformations. 
In the Hermitian basis with A real and non-negative, the last term in V is 


Va = —A(¢1¢3 — p201), (3.182) 


while the other terms depend only on ¢7 + 43 and $3 + ¢3. The nontrivial extremum 
conditions for v2.4 = 0 are 


OV 1 

=——| = pa + A? + =A vı — Av3 =0 

gı |, 2 

av 1 (3.183) 
Idz i = r + A3 + Zann? vz — Anı =Q. 


These have the trivial solution vı = v3 = 0, and possible nontrivial solutions with vı Æ 0, 
v3 # 0. The mass-square matrix for ¢1,3 (there is no mixing with ¢2,4 for either case) 
corresponding to the trivial solution is 


av av u A 
= 3g? 04610 = 
HRe = ( i oe a = ( a u ) . (3.184) 
d61063 Əş i 


11,3=0 
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This is a minimum provided the two eigenvalues ju? , are both non-negative. The trace and 
determinant are preserved by the unitary transformation which diagonalizes 43e, 


Motes = HI +UI Mabe = HIHI A’, (3.185) 


so the origin is a minimum provided both of these are positive, in which case there is no 
spontaneous symmetry breaking.?° Otherwise, the origin is a saddlepoint with one negative 
eigenvalue (for 747, — A? < 0), or a maximum with two negative eigenvalues (for w7+p7); < 
0 and p77, — A? > 0). In these cases, there will be SSB. 11,3 can be found numerically 
from (3.183), and the mass-square matrix for 1,3 can be written 
av av. 
Fo 3p? 3pı0p3 
HRe = aV aV 
0¢10¢3 ZH 


2v? + Avs /v AIV — A 
= ( ALIIIV1V3 = A 2rr1V3 + Av, /V3 ) i (3.186) 
11,3 
where u? rr have been eliminated using (3.183). 
More interesting is the mass-square matrix H? for 2,4. For 11,3 = 0, 7,,, is the same 
as (3.184) except A + —A. This has the same eigenvalues as 7, and the symmetry is 
unbroken. For the SSB case, one can again use (3.183) to obtain 


av xo vey 
a ( Ome te =a( Foa ). (3.187) 
11,340 V3 
VY 


062064 api 
This has a zero eigenvalue, corresponding to the Goldstone boson of the broken U (1). The 
unnormalized eigenvector is (vı —v3)". This is in the Lv direction, as expected from (3.174), 
since in the Hermitian basis, 


Vy 
=z 0 0 

L= ( 0 72 ) ; v= ia F (3.188) 
0 


where the 0 in L is the 2 x 2 zero matrix, in analogy with (3.74) or (3.147). The other 
eigenvalue is A(~3 + £1), with (unnormalized) eigenvector (vs 1)". For A —> 0 this state 
also becomes massless, i.e., both generators of the enhanced U(1) x U(1) are broken and 
there are two Goldstone bosons. 


An alternative derivation of these results utilizes a polar basis, analogous to (3.161), i.e., 
by = VIAN) geii, (3.189) 
V2 
where vr = 11, Vr = V3, and where 77 and Ey are Hermitian. The normalization of the 
exponents is chosen so that the kinetic energy terms 


Lre = ||," = ay (om + (0I)? i + | ) (3.190) 
J J VK 


are canonical for real fields, although there are new three- and four-point interactions in- 
volving derivatives that we will not be concerned with here. Under a U(1) transformation 
the £z are shifted while the 77 are unchanged, 


friar > S111 Evy rp. (3.191) 


26There are no other minima provided (3.181) is satisfied. 
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The £z only enter the V4 part of the potential 


Va = —A(nr + 41) (N11 + vrr) Cos (f+ u) j (3.192) 


which has minima for bt +5 S11 — 0. We will consider the minimum at €; = 0, with the other 
values related by U(1 i tr ansformations. The calculation of the VEVs and the mass matrix 
for the 77 proceeds as before. The mass matrix for the €; is obtained by expanding Va to 
second order, 


v, = Anı v3 (£ È s) 2 l4 (2 A) (usr + vir)? (3.193) 


T 
2 Vr vīr 2 v V3 v2 + v3 


This reproduces our previous result, with one Goldstone boson and one massive state. 
Approximating ¢7 ~ fetes tita), the eigenvectors and eigenvalues are the same as found 
previously. 


3.4 PROBLEMS 
3.1 Verify the Lie group multiplication rule in (3.6). 


3.2 Show that with the normalization (3.8) the structure constants c;;, are antisymmetric 
in all three indices. 


3.3 Prove the Jacobi identity 
[L*, [L , L*]] + [LŻ, [L*, L] + [L*, [L*, L7]] =0 


for an arbitrary representation and use it to prove that the adjoint representation matrices 
in (3.16) satisfy the commutation rules in (3.7). 


3.4 Use (3.10) to find an explicit expression for 7(@ B ) defined by the multiplication in 
(3.5) for the group SU(2). 


3.5 Prove the formal power series identity 


1 1 


e4Be-4 = B+[A, B] + zg [4 B] + 1A [4 [4 B]] ++, 


where A and B are any two operators or square matrices of the same dimension. 


3.6 Prove that det[e*4] = X (aA)! /k! = e*T4 for any Hermitian matrix A. This implies 
that the condition for a unitary or orthogonal matrix to be special (unit determinant) is 
that the generators be traceless. 


3.7 Prove the transformation law (3.32) for a field transforming according to the adjoint 
representation. 


3.8 Show that 1 
L= 5 |(Auba)” - Pe] -a (92)", 


with ġa = $1, is invariant under infinitesimal SO(m) transformations (i.e., [T*, £] = 0), 
with ġa, a = 1- -m transforming as the vector representation. 
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3.9 Derive the Noether currents in (3.54) and (3.79) for complex scalars and fermions, 
respectively. In each case, use the canonical commutation rules in Appendix A to show that 
the associated Noether charges satisfy the Lie algebra commutation rules. Assume there are 
no interaction terms involving field derivatives. 


3.10 In the free-field limit the isospin generators for the pion-nucleon fields in (3.82) can 
be written as 


=f es Ls) (E) aos rains (HE) a r) 


3 
i) ae ene aa 


where ba and da are, respectively, the annihilation operators for nucleon a = p or n and 
for its antiparticle, and a; is the annihilation operator for 7’. Show that the T? satisfy the 
SU (2) Lie algebra and that they have the right commutation rules with the fields. 


3.11 Show that the pion-nucleon Lagrangian density in (3.84) is SU(2) invariant by prov- 
ing that [T’, Lo] = 0. Use the representation matrices in (3.83). 


3.12 Show that the Condon-Shortley convention (analogous to that used for rotations) 


(L +iL7)\|\I'm) = /1$m)\T+m41)|Im+]) 


for the states in an isospin J multiplet with Ig eigenvalue m would require më = 
(mı F im) /V2 instead of (3.82). 


3.13 The pion and nucleon transform as isospin triplets and doublets, respectively. The 
approximate isospin invariance of the strong interactions implies that all of the amplitudes 
for 7;Nq — 7;N> can be expressed in terms of two amplitudes My, corresponding to total 
isospin 7 = 1/2 and 3/2. Write the amplitudes for +p > ntp, mp > 1p, mp > n?n, 
and 7°n > 7°n in terms of the Mz. (Note that at low energy the M3/2 amplitude, which is 
dominated by the A resonance, strongly dominates. Such calculations can be extended to 
SU(3) by means of isoscalar factors (de Swart, 1963; Patrignani, 2016).) 


3.14 It is sometimes useful to introduce a spurion, which is a fictitious or composite field 

with the right quantum numbers to parametrize the effects of symmetry breaking, and 
which can therefore be used as a bookkeeping device. For example, it is known that the 
effective operator for strangeness-changing nonleptonic weak transitions, which could in 
principle involve both AI = 4 and AT = 3 components, is strongly dominated by AI = Ł, 
Therefore, the transitions Ut > 7p, Et > mtn, and ET — mn should be related in 
the same way as the isospin-conserving amplitudes for tS — mp, +S —> ntn, and 
E-S + 2~n, where S is a spin-0 spurion carrying J = 1/2 and I3 = —1/2. Show that the 
amplitudes (using the Condon-Shortley convention for 7+ and £+) should be related by 


M(t > atn)— M(© > an) = —V2M (= > 7°), 


This relation, which holds separately for the S and P wave amplitudes (Problem 2.26), 
works reasonably well. (This simple example could be handled easily using the Wigner- 
Eckart theorem, but the spurion formalism is more convenient in more complicated cases.) 
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3.15 Generalize the pion self-interaction term in (3.84) to the case of four distinct types 
of pions, 74; = Ai, ngi = Bi, nci = Ci, tpi = Di, each transforming as an adjoint under 
SU (2). In general, there are cubic and quartic interactions such as 


Lr = —cijkl Ai Bj;Ck Dı = dijpAiBjCp. 


(There are also other possible terms, such as A?BC or ABD, which we will not consider.) 
However, £z is only SU(2) invariant for the combinations 


Ly = — ca Tr (AB) Tr (CD) — cgTr (AC) Tr (BD) 
— c,yTr (AD) Tr (BC) — d Tr (ABC), 


where A = A- 7/2, etc., so that Tr (AB) = A- B. (If A, B, and C are all pseudoscalars, 
then reflection invariance would require d = 0.) 

(a) Show explicitly that invariants like Tr (ABCD) are not independent. 

(b) The existence of only 3 quartic invariants can be justified as follows: A,B,C, and D 
are all isospin-1 operators. Thus, the product AB is a direct sum of operators Oe with 
isospin 4g = 0,1, or 2, and similarly for CD. However, the only overall singlets involve 
the pairing with [4g = Icp. Applying similar reasoning to the cubic term, 


2 
Lr=—)  c7OpObp — Op C. 
I=0 


Construct these invariants explicitly, and relate the coefficients cy and dı to Cg,g,y and d. 
(c) Show why there is only a single invariant in (3.84). 


3.16 The effective (non-renormalizable) interaction of a hypothetical doublet of mesons A 
with the nucleons is assumed to take the isospin-invariant form 


Lan = (oT) - (AFA), 


+ 
(R) (2) 


and 7’ are the Pauli matrices. Calculate the tree-level spin-averaged CM cross sections for 
Atp — Atp, A®°p > A®p, and Atn > A?p. Assume that mp = Mp and m4+ = myo. 


where 


3.17 By comparing the SU(2) and SU(3) expressions in (3.84) and (3.92), show that 
In = 5 (97 + Ja). 


3.18 Let A; and By, transform as SU(3) octets. Prove that F; = ifij,A;B, and 
Di = dijxAj;By also transform as octets. Note that the result for F; generalizes to any 
simple Lie algebra, while that for D; generalizes to SU(m). 


3.19 The interactions of pseudoscalar mesons with hyperons can be described by 
L = J9r+pn 1 DIP N+ gg+pa K PIYA + GKtno- Kni +--+ + h.c., (3.194) 


where the dots refer to other mesons and hyperons. Show that gg+ngx- can be expressed in 
terms of gr+pn and gx+pa in the SU (3) limit, and find the expression. 


3.20 Starting from the ansatz in (3.98), derive the Gell-Mann-Okubo mass relation (3.105) 
for baryons. 
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3.21 From (3.89), the electromagnetic current can be written as Jou = J} + van where 


J; and J} are elements of an SU(3) octet of currents. Jo, is a singlet ae the U-spin 
subgroup of SU(3), which has generators U? = — 4T’ + oe, U+ = T + iT". (a) Use 
U-spin to show that in the SU (3) limit the hyperon magnetic moments are related by 


1 3 
Hg+ = Hp, Hn = Heo = T3. + aha 
V3 
Hs- = Hz-, HEA = -7 (uso — pa), 


where uya is the transition moment observed in E° > Ay. 

(b) Use isospin to show that yo = 4 (“y+ + uy- ). 

(c) Use SU(3) to show that uso = —pyu4, allowing all of the magnetic moments to be 
expressed in terms of up and un. Hint: use the analog of (3.102). 


3.22 In the non-relativistic quark model the baryon octet is approximated as three-quark 
states that are totally symmetric in flavor, space, and spin but antisymmetric in color. The 
total and internal orbital angular momenta are zero. 

(a) Show that the flavor-spin part of the proton wave function is 


p+) = 


[|und) x (21 + +=) I+-+)-| ++)) 


+ {|123) -> |231)} + {(|123) > [312)}], 


1 
3/2 


where + denote spin projections of 4 +5, and the second and third terms are cyclic permu- 
tations on the three quarks. The neutron wave function is similar except u + d. 

(b) From (2.395) the proton magnetic moment is ff, = IphN Sip = (1 + Kp)UNGp, 
where Kp ~ 1.79 is the anomalous magnetic moment. Similarly, the neutron moment is 
Uw = JnliN Sn = KnUNOn, With kn ~ —1.91. These can be estimated in the quark model 
by assuming that the quarks are pointlike, i.e., that their magnetic moment operator is 
fi = Qir, Where Q has eigenvalues e, = 2 and eg = —ł, and M ~ m,/3 is the con- 
stituent quark mass. The nucleon moments i then be calculated by taking the expectation 
value of $ 19 3 fia, where ji, acts on the at” quark. Calculate the ratio gp/gn and compare 
with the experimental value. 

(c) Calculate the individual values of gp/2 and gn/2 and compare with experiment. 

See (Georgi, 1999) for a more detailed discussion of the magnetic moments of the baryon 
octet and the relation to SU(6). 


3.23 Draw the Feynman diagrams and write down the amplitudes for the processes yp —> 
n’p and yp > K*™° in the SU(3) limit, using the meson-baryon Lagrangian density in 
(3.97). Ignore strong interaction effects in the couplings of the photon. (It is not necessary 
to actually calculate |M|?.) 


3.24 Prove that the operator e° mPa, --a in (3.120) is invariant under SU(m) but not 
under U(m). 


3.25 Using (3.116), the fields 


vey = (Was — Va), Vy = Cad (Wad + Uva) [no sum] 


5 
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transform, respectively, as antisymmetric (A) and symmetric (S) products of two SU(m) 
fundamentals. Here, Cab = 1/ V2 for a Æ band Caa = L, The coefficients are chosen so that 
the corresponding states are properly normalized. 

(a) Calculate the representation matrices (Z4) ,,,,q (in terms of the fundamental L’ = Lin) 
and the Dynkin index T(L4). Show that T(L4) has the expected value for m = 2 and 3. 
(b) Calculate (L5) assed and T(Lg). Show that the L4 for m = 2 are of the correct form. 


Hint: be careful to avoid double counting in sums over the elements of y4’. 


3.26 Let day, a,b = 1---n be a complex scalar transforming as (n,n*) under SU (n)z x 
SU(n)r, i.e., l l 
[TE av] = — (Ln Jace, [TR; Pab] = +pac(Ln)cb, 


where T} and T$ are, respectively, the generators of SU (n)z and SU (n)pr. 
(a) Calculate a finite transformation of ¢. 
(b) Show that the Yukawa interaction 


Ly = htbardbartor + h*dor(bas) Yar = hdrdovr + h*yro YL 


and the mass term 

Lm = =u? (dav) bab = —p’Tr (¢'¢) 
are invariant under U(n); x U(n)p. In the second forms, ¢ represents an n x n matrix with 
components Pab, and ($")ya = (av)! 
(c) Construct the most general U (n)z x U(n) pr invariant renormalizable Lagrangian density 
for w and ¢. 
(d) Display the U(n)z x U(n)r Noether currents for the Lagrangian density in (c). Hint: 
both ¢ and ¢' must be included in the sum over fields in (3.46). 


3.27 Prove that the color-singlet states |ud), |du), and |uū + dd) are eigenstates of G 
parity with eigenvalue (—1)’+S+/, where L, S, and I are, respectively, the total orbital 
angular momentum, spin, and isospin. 


3.28 Find the exact domain wall solution to (3.134) corresponding to Figure 3.4 and use 
this to justify that the wall thickness is O(|u|~!). Use the Hamiltonian density in (A.7) to 
compute the energy density (with respect to the ground state) at a distance x from the 
wall, and compute the energy per unit area in the wall. Hint: look for a solution of the form 
g(a) = a tanh(bz). 


3.29 The O(2) invariance of (3.146) is explicitly broken by the mass term in (3.151). 

(a) Identify any unbroken symmetries and describe their consequences. 

(b) Show that charge is not conserved and draw a Feynman diagram that demonstrates 
this, e.g., by allowing the reaction ¢+¢- — tototo, where ¢+ = ($—)! is the particle 
created by øt. Treat the symmetry breaking term as a mass insertion on an internal line. 
(There can also be mixing associated with the external legs, leading to processes such as 
toT — ot¢*. However, the degeneracy of the states for e — 0 makes this case more 
complicated to treat. It is very much like K? — K? mixing treated in Chapter 8.) Note that 
this is a spin-0 analog of a Majorana neutrino mass term. 


3.30 Consider the spontaneously broken theory of a complex scalar field in Section 3.3.3, 
which is described after symmetry breaking by the Lagrangian density in (3.155) for a 
rectangular basis, or by (3.163) in a polar basis. 
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(a) Show that the on-shell amplitudes for ¢{(p1)$5(p2) > $1 (p3)¢4(p4) and n(p1)E(p2) > 
n(p3)&(p4) are the same at tree-level. 

(b) Now consider pseudoscalar-fermion scattering $4(p1)w(p2) > $5(p3)(pa). The Yukawa 
interactions are given for three parametrizations in (3.159), (3.164), and (3.166), and the 
corresponding scalar interactions in (3.155) and (3.163). Show that the tree-level amplitudes 
are the same in all three parametrizations. 


3.31 In theories involving scalars and fermions it is sometimes the case that each mass 
eigenstate scalar field couples either as a scalar or a pseudoscalar to fermions, i.e., that 
there is a conserved parity. The SM and the MSSM at tree level are examples of this. As a 
simpler example, consider a single Dirac fermion w and a single complex scalar ¢, with 


L =i py + (0"9)" (uG) — mbrde — m*brbe 
+ hbrobr +h prd yr -u Hg- roo- rog, 
where u? is real, while m, h, and « are complex. We ignore ¢ self-interactions for simplicity, 
and choose u? > 2|k| > 0 to ensure that ¢ does not acquire a VEV. Consider the cases 
(a) m = k = 0, (b) m= 0,4 Æ 0, (c) and m £ 0,x Æ 0, always with h Æ 0, u? 4 0. In 
each case, find the spectrum, interpret it in terms of the symmetries of £L, and determine 
whether P, C, and CP are conserved. Hint: identify which of the phases in 


vr > erp,  pr> yr ore 


are symmetries and which are field redefinitions. 


3.32 The most general fermion number conserving Lagrangian density for a free Dirac 
field is 

L = pi Pla + byw — ble + iydly, 
where Hermiticity requires a,b,c, and d to be real. The canonical form in (2.152), which 
corresponds to a = 1, c = m, and b = d = 0, is needed to maintain the classical relation 
E? = p° + m?. However, the more general form is acceptable (and may emerge from an 
underlying theory) provided that £ can be put into canonical form by a field redefinition 
prior to quantization. 
(a) Show that for a > |b| > 0 (and all of the parameters real), one can find a multiplicative 
redefinition, Y = Fw, with 


F(a,b,c, d) = Fr(a,b, C, d) + Fs(a, b, C, a), 


such that £(w’) is canonical, i.e., L = Y' (i 9—m)v", with m > 0. Construct F explicitly, and 
find m(a, b, c, d). Hint: write y in terms of its chiral components, Y = YL +r = Prw+Prv, 
where it will be seen that d corresponds to a complex mass. 

(b) Specialize your results to the “wrong sign mass” case, a = 1, c= —|m|, b= d = 0, and 
interpret the result. 


3.33 The potential corresponding to (3.130) can be written for A11 = A22 as 


1 A 2 ANG? ¢ 
V(b1, b2) = 3H (PI + 63) + | (bt + 42) +S: 
which is clearly O(2) invariant for à = 0, but only invariant under sign changes and ¢1,2 
interchange for A Æ 0. Assume that u? < 0 and à > 0. Find the minimum, the physical 
particle masses, and the unbroken symmetries for the three cases X’ > 0, X = 0, and X < 0. 
In the last case impose the necessary boundedness condition. 
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3.34 Consider the potential 


2 
V(¢1, b2, 63) = 5 (9 + $5 + $3) + Lis b1¢2 + His 6163 
+ ås 203 + Aro (01 — $3)? + A3 43. 


Find the conditions for boundedness (assume Ai2 Æ 0, A3 Æ 0), and the special cases for 
which there are flat directions. 


3.35 Specialize the example in Section 3.3.8 to the case Ay = Ayr = —Arrr/2 = A > 0, for 
which the boundedness condition (3.181) does not hold. This is a simplified version of the 
two Higgs doublets in the MSSM, in which the U(1) is a gauge symmetry. 

(a) Find the condition on 4? zz, A, and A for the existence of a minimum of V. Assume that 
A is real and non-negative. 

(b) Show that A > 0 is a necessary condition to have both vı 4 0 and v3 Æ 0. 

(c) For A > 0, what is the condition that the origin (11,3 = 0) is not a minimum? If that is 
the case, is the origin a maximum or a saddlepoint? 

(d) Find explicit expressions for v? = vj + v3 and y = tan7'(v3/v,) in terms of uî rr, A, 
and A. 

(e) Calculate the nonzero eigenvalue py, of u?m in terms of A and y. 

(£) Express the mass-square matrix uke in terms of u4, y and MZ = 2dv? (motivated by 
the MSSM prototype). 

(g) Prove the sum rule p2 + pg = p4 + MZ, where u? „ are the wR, eigenvalues. 

(h) Prove that u2 < M2 cos? 2y and u2 > u3 + M2 sin? 2y, where the labelling is such that 
ia < be 

(i) Show that these inequalities are saturated for large y?,/M#, and interpret this limit. 


3.36 This problem involves a chiral fermion and complex scalar, with an internal global 
symmetry that may be both spontaneously and explicitly broken. Consider the Lagrangian 
density 


L = wi Pv+0,6'0"o— hl[drode + dre'vz] 
—p?o'o— AH)? talo +g), 


where Y = Yr + wp is a fermion, ¢ is a complex scalar, À > 0, h > 0, and a > 0. 

(a) Show that £ has a global chiral symmetry U (1) x U (1) for the case a = 0. 

(b) Calculate the spectrum of the model (i.e., the masses) for the cases (i) u? > 0,a = 0 
and (ii) u? < 0,a = 0. 

(c) Calculate the spectrum for the cases (i) u? > 0,a > 0 and (ii) u? < 0,a > 0. In each 
case assume that V'Àa/|u|3 < 1, and keep only the leading nonzero term in that parameter. 
(d) Interpret the spectrum in each of the above cases in terms of the symmetries and 
symmetry breaking. 

(e) Now add the Majorana mass term —%4(¥7v~% + Yeu) to L, where Yh defined in 
(2.301) is essentially the CP conjugate of Yz. (An analogous term could be added for wz.) 
Show that this violates the U(1) fermion number symmetry. Calculate the divergence of the 
corresponding Noether current. 
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CHAPTER 4 


Gauge Theories 
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In Chapter 3 we considered continuous global symmetries, parametrized by real constants 
B'. In local or gauge symmetries! the 6 are promoted to arbitrary differentiable functions 
b! (x) of space and time. Gauge invariance is sometimes motivated on esthetic grounds. For 
example, why should the phase of the electron field on the Earth be correlated with its phase 
on Mars? This is not entirely compelling because the standard model does involve global 
symmetries (though they may derive from gauge symmetries in an underlying theory). In 
any case, we will take the pragmatic view that gauge invariance is a powerful tool for 
constructing well-behaved field theories and are the unique renormalizable field theories for 
spin-1 particles. In particular, each generator of a gauge invariant theory must correspond 
to an (apparently) massless spin-1 gauge boson, which mediates an (apparently) long-range 
force, and the diagonal generators of an unbroken gauge theory correspond to conserved 
charges (Weyl, 1929). The gauge interactions are uniquely prescribed once one specifies 
the gauge group, the representations of the matter fields, and a gauge coupling constant 
g for each group factor. This approach is opposite to historical development: Maxwell’s 
equations of classical electrodynamics were first derived from observation and consistency, 
and then it was noticed that they were invariant under gauge transformations, i.e., that 
the vector and scalar potential involved redundant degrees of freedom. In this chapter, we 
outline the construction of gauge invariant Lagrangian densities. More detailed treatments 
include (Abers and Lee, 1973; Weinberg, 1973c,d, 1995; Peskin and Schroeder, 1995). 
Let @,, a=1---n, represent n spin-0 or 4 fields that transform as 


B, (x) > (arty) ®ı (1) = [uže] 


P a (2) (4.1) 


under a gauge transformation (generalizing (3.30)). L’, i = 1--- N, are n x n representation 
matrices of the Lie algebra of the gauge group G. Equation (4.1) is equivalent to 
D(x) > 8 (2) = iF a(x) = U(8(z)) O(a), (4.2) 


where ®(z) is an n component column vector with components ®,(). If a theory involves 
ny fermion fields Ya in a column vector w and ng real or complex scalars ġe in a vector ¢, 
then 


V(x) + eF Log (x) = Uy (A(x) ¥(2) 


ee S (4.3) 
plz) > Pe 12 d(x) = Us (B(a)) G(2), 


1 Gauge transformations are often referred to as redundancies rather than symmetries because they refer 
to unobservable degrees of freedom rather than relating different systems. 
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where Ly and Ly are respectively the fermion and scalar representation matrices. In the 
case of a chiral symmetry, 


Li, = Li Pr + LePr with Li, # Lh, (4.4) 
so that 
x(x) > e70 Tiyra) =U; (8(2) ox (a) ae) 
prls) > PO Ryne) = UR(B(2)) dele). 


The transformations of the necessary gauge fields will be detailed below. 


4.1 THE ABELIAN CASE 


As a first example, we take G = U(1). We already considered the electromagnetic inter- 
actions of charged pions and electrons in Sections 2.6 and 2.8, but repeat the key results. 
Under a (non-chiral) gauge transformation 


; ; 1 
b> eB yp, bat > erg t, A, > A, — —0,8, (4.6) 
È 


where w is the electron field, ¢,+ and ¢,- = pla are respectively the + and 77 fields, A 
is the photon (gauge) field y, and e > 0. The gauge covariant derivatives transform as 
Diy = (8! — ieA") y > e ¥O Drp 


l 4.7 
DY bas = (3H + ie A!) bps > et? Deg. an 


Thus, 


7 i 1 2g i ? 
L= PED —m)b + (Dupri)? DY bat — GFF -Wohn A (dhada) » (48) 


where F,,, = ô Av — O,A, is gauge invariant. However, an explicit photon mass term 
Ma A), AH is not gauge invariant, so the y must be massless. The Feynman rules for the gauge 
vertices are given in Figures 2.10 and 2.13. They are unique except for e and the charge 
assignments. However, the mass parameters are arbitrary. Only one pion self-interaction is 
consistent with U(1), but the coefficient À is arbitrary. 

These considerations are easily generalized to an arbitrary non-chiral U(1), with ny 
fermions Ya and ng complex scalars Øp, with charges qa and q; in units of the gauge coupling 
g. (The simple QED example had qe- = —1, qi, = +1, and g = e.) The fields transform as 


. ee à 1 
Va > IPO, dp a POD, An Ay — gute); (4.9) 


where A is the gauge boson. One can think of the charges as the elements of the (reducible) 
diagonal fermion and scalar representation matrices Ly = diag (qı q2°**Qn “) and Lg = 


diag (a qh: a) The gauge covariant derivatives are 


Dh ta = (O" + igqa A") Ya > ÀLE Dlg 


N (4.10) 
D" by = (3! + igq, A”) by > e!%F() DH hy, 
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or in column vector notation, 


Diy = (OHI + ig A" Ly) p > elo Drap 


; 4.11 
D'o = (OHI + igA"Lg) > Eee DHe, oe 
where J is the ny X ny or ng X ng identity. The kinetic terms 
_ 1 
Lo=viPy + (Duo) D" -=F F" (4.12) 
= 4 


gt (9 ,1-igAuLs) 


are gauge invariant. These lead to vertices with the massless gauge boson similar to those 
in Figures 2.10 and 2.13, except e > gq, for œp and —e — gqq for qq. (There are no 
off-diagonal transitions such as bı > wy.) 

It is clear that only gqq and gq, are physical. One can always rescale g provided qa, qj, are 
rescaled accordingly. For example, this freedom is used in QED to set the electron charge 
to —1. In a pure U(1) theory, the relative values of the charges are arbitrary. However, if 
the U(1) is a subgroup of a simple group, then the relative charges are fixed by the higher 
symmetry. 

It is straightforward to extend these considerations to a chiral U (1), i.e., in which the 
L and R charges qar,r of ta are different (this is not the case for QED). Define Ly = 
L,Py + LrPr, where Lz, r are the diagonal charge matrices of Yz, r and Pr, r are the 
chiral projections in (2.196) on page 40. Then, the fermion term in (4.12) becomes 


viDy =o (i Pl — gALy)b =o (i pI — gA[Li Py + LePr)) Y, (4.13) 
and the pa vertex in Figure 2.13 is 


i daL + VaR daL — TaR 
—igYu ( 5 ) ( 5 ) a : (4.14) 


One can add mass and additional non-derivative interaction terms to Lo provided they 
are invariant under the global U(1). For example,” 


L = Lo — damasts — bh uz aba 
+ [BaP Syvude + Baita obe + hic] + Aaea ohea 


(4.15) 


would be invariant provided qa = qe for nonzero Map, ql, = qh for nonzero uĉa, da = œ + q! 
for nonzero TS, or TS,, and q4 + q, = q + q} for nonzero Aabed- 


4.2 — NON-ABELIAN GAUGE THEORIES 


Now consider a non-abelian gauge symmetry (Yang and Mills, 1954), in which the spin-0 and 
4 fields transform according to (4.3), where G is a simple group. Any mass terms, Yukawa 
interactions, or non-derivative scalar self-interactions that are invariant under the corre- 
sponding global symmetry are automatically gauge invariant as well. However, the fermion 
or scalar kinetic energy terms contain derivatives and are therefore not gauge invariant, 


OY — alt [u (Ba = Ud" + [a"U] ð. (4.16) 


2In (4.15) the matrix u? must be Hermitian, and there are constraints on the abea from Hermiticity and 
Bose statistics. mg, is assumed Hermitian, but non-Hermitian fermion mass matrices can be introduced by 
rewriting in terms of Yz, r (or allowing 7° terms). Other types of cubic or quartic scalar self-interactions 
could be allowed if they are globally invariant, such as ¢adpgc + h.c. for q4 +q, + q4 = 0. 
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One must therefore introduce a gauge covariant derivative 
OY > D! = 0" + ig AP - L, (4.17) 


i 
where Aj, 


(real) gauge coupling, A”. E = AL‘, and L’ are the representation matrices for ®, i.e., 
L’ = L}, Li,, or Ly, Pp + LPR. Of course, 0" = OMI, where I is the n x n identity matrix. 
In terms of components, 


i = 1---N, are real vector gauge fields (one per generator), g is an arbitrary 


(D"®), = [a das + igA” - Lal Do. (4.18) 


These gauge covariant derivatives specify the interactions of the spin-0 or spin-4 fields with 
the gauge bosons in terms of a single gauge coupling g (or one per group factor for a non- 
simple group), once the group and representations are specified. The Feynman rules for the 
gauge interactions are shown in Figure 4.1. For example, the fermion kinetic energy term 


LKE, = vido (4.19) 
implies that the amplitude for y, to absorb or emit gauge boson A’ and turn into Ya is just 
g l1- ee cee a i 

—igy" Lab + Lab 7 Coa igg" Lap: (4.20) 


This must be sandwiched between appropriate u or v spinors and contracted with a gauge 
polarization vector, as described in Chapter 2. Unlike the abelian case, the non-diagonal 
generators imply transitions between the different members of the fermion (or scalar) IRREP 
that are related by the symmetry. 

Similarly, the kinetic energy term for a complex scalar representation becomes 


(D6)! Dub = (9)! Oud — igAi, (GV DELG) + gA A! GLL G. (4.21) 


The second term on the right yields the three-point vertex in Figure 4.1 since 0“¢ > —ip” o 

( 04g! — +ip” ot), where p is the momentum entering (exiting) the vertex for ¢ (@"), i.e., 

p always flows in the direction of the arrow. The four-point vertex from iLx« A follows from 

the last term, taking into account that there are two ways to contract A‘ AJ, with the 

external gauge fields. The vertices for Hermitian scalars are considered in Problem 4.6. 
We must still determine the transformation of AŻ. The first requirement is 


D! > UD”. (4.22) 
This implies that the fermion and scalar kinetic energy terms 
L = pi He)! 
> PiU} Up Pyh + (D5) ——-ULUsDond (4.23) 
a 
ot (Bur—igãn- E4) 


are gauge invariant, since UTU = I. The necessary transformation is given by 


pbs A SA bh 4 (3 U) UTE, (4.24) 
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a 
igy" Li ap = —igy" [Liab PL + Lira PR] 
i, 
b 
a, Pa " F = = i 
`, —ig(Pa + Po)” Lap 
~~ is H 
{í 
+ 
b, Pp 
apa , iu ig?g” (L414, + LAE) 
y% ab 
aA 
b, Pp ’ j, v 
HEL p tgcigr l9" (q — p)? + 9%? (p — r)” 
+9"? (r — q)"] = gcijk CY’? (p,q,r) 
k, o 
r 
. q 
J, V 
i, u l, p 
’ , = ig? CijmCkim( gI g”? p g"? g9) 
— ig? CikmCjim( g” g — gt’g’?) 
— ig? CilmCjkm h(g” 9°" — g7 g’) 
jv k, o 


Figure 4.1 Feynman rules for the interactions of gauge bosons with fermions, scalars, 
and gauge self-interactions. The scalar vertices apply to both complex or Hermitian 
scalars. pa and pp in the second vertex are the momenta flowing in the direction 
of the arrows, which in some cases are the negative of the physical momenta. For 
example, Pa = —Pa if the a is twisted downward to represent an incident antiparticle 
or Hermitian scalar, while pp = —p» for an outgoing antiparticle or Hermitian scalar, 
as in Figure 2.10. Recall that Liab = — Lin, for a Hermitian scalar. In the triple 
gauge vertex the momenta p, q, and r all flow into the vertex and satisfy p+q+r = 0. 
The function C“”? (p, q,r) is totally antisymmetric if the indices and corresponding 
momenta are both interchanged. 
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where L’ is any nontrivial IRREP, such as Li, or L4, and U = U(x) = eP) L, The first 
term on the right is the expected transformation for an adjoint representation. It would 
be present even for a global transformation, as in (3.32) on page 97. The second term 
generalizes (3.32) to a gauge transformation. One has 


> 


- TGA) L) 
At = —__# 7 4.25 
where the Dynkin index T(L) is defined in (3.8). It can be shown that A‘/ is independent 
of the representation L. It is then straightforward to establish that (4.22) holds. For small 


= 


B(x), (4.24) reduces to 
1 À 
Ali = AÑ, — cijkbpI A — gone (4.26) 
where the first term represents the global transformation of an adjoint field, and the second 


is reminiscent of abelian gauge transformations. 
The gauge boson kinetic energy term is 


1: : $ ; ‘ ; 
E i iuv : t — i i k 
LKE, = aw P with Fi, = ô Ap — OvA,,— Icijk Al Ap - (4.27) 
ee eee 
kinetic energy self-interactions 


Unlike the abelian case, the field strength tensor F i contains a quadratic term proportional 
to the structure constant, i.e., the non-abelian gauge bosons are themselves charged. This 
leads to 3 and 4 point self-interactions, as shown in Figure 4.1. To see the gauge invariance 
of Lez, it is convenient to rewrite 


LKE, = -r (Ee I) Í (4.28) 
where ki T DOSE a Cea a 
F,,-£=0,A,-L-a,4,-E+ig |2, 2,4,- E]. (4.29) 
One can then show that 
FR, L> Fp L=UF,-£U-}, (4.30) 


so that (4.28) is invariant. However, an elementary gauge boson mass term į M2 A‘ Al, is 
not invariant, establishing the statement that there is one (apparently) massless gauge boson 
and the associated long range force for each gauge generator. Altogether, the Langrangian 
density for a gauge theory of fermions and complex scalars is 


E T ij 
L =F E + pid + (D5) Den +: (4.31) 
Scalar or fermion mass terms, Yukawa couplings, and non-derivative scalar self-interactions 
consistent with the global symmetry can be included. 


Direct Product Groups 


The formalism can easily be extended to the case in which the gauge group G is a direct 
product of two or more simple or abelian factors, e.g., G = G1 x G2. The gauge covariant 
derivatives in (4.18) for scalar or fermion fields become 


D'o = [a + ig, AY - Ey + igo At - Lo) ©, (4.32) 
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where gm, Åm, and Drs m = 1,2, are, respectively, the gauge coupling, gauge bosons, and 
representation matrices for Gm. Similarly, the gauge boson kinetic energy terms become a 
sum over those for the group factors 


1 


1 i iuv i tuv 
LKE, = g imt m gimt" ` (4.33) 
The field strength tensors are unchanged, 
Figo = Oy Arie _ cre eam ~ Im Cmijk Af p Ahy. (4.34) 


where Cmijk are the structure constants for Gm. Thus, the fermion and scalar 3-point vertices 
in Figure 4.1 apply separately to G1 and G2. The same is true for the gauge self-interactions, 
which do not connect the gauge bosons of different group factors because they commute. 
However, scalar multiplets that transform under both groups lead to mixed seagull diagrams. 
The Feynman rule for the mixed ¢/,¢) Aj" Aj” vertex is 


2ig1 92g” (ri oi 3 zi (4.35) 


4.3 THE HIGGS MECHANISM 


We have seen that gauge theories do not allow elementary mass terms for gauge bosons, 
because they would break the gauge invariance and spoil the renormalizability. This ap- 
pears problematic for the weak interactions, which are short-ranged and require massive 
mediators. 

Another potential problem, for theories with spontaneous symmetry breaking, is that 
they imply massless Goldstone bosons, but there are no known exactly-massless Goldstone 
bosons associated with the elementary particle interactions. Fortunately, when G is a gauge 
symmetry the two problems of the unwanted Goldstone bosons and the unwanted massless 
gauge bosons can cure each other when the symmetry is spontaneously broken (Anderson, 
1963). A particularly simple implementation is the Higgs mechanism, involving elementary 
spin-0 fields (Higgs, 1964, 1966; Englert and Brout, 1964; Guralnik et al., 1964). Instead of 
existing as a massless spin-0 particle, the degree of freedom carried by the Goldstone boson 
manifests itself as the longitudinal spin component of a gauge boson, which has in the process 
acquired a mass. (One says that the Goldstone boson has been “eaten.” ) Remarkably, SSB 
via the Higgs mechanism preserves the renormalizability of the theory (’t Hooft, 1971a; 
’t Hooft and Veltman, 1972; Lee and Zinn-Justin, 1972, 1973). 

To illustrate this, consider a U(1) gauge theory with a single complex scalar field ¢, with 


L= =i + [(Ou + ig Ay) 6]! (O" + igA")$ — BHO- AlO)? (4.36) 


as in (2.133) or (4.8). We studied the analogous problem of a global U(1) symmetry in 
Section 3.3.3. It was found that for u? < 0, ¢ acquired a VEV, 


ae me oe 
e= F (4.37) 


Expanding ¢ = (v +0 + ix)/ V2 around v/v2, there was one massive physical scalar 7 and 
one massless Goldstone boson x. 
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The minimization of the potential for a gauge symmetry is the same as for the global 
case, and (4.37) continues to hold. Rewriting (4.36) in terms of the new fields, 
1 yl gv? 
L=- i M ee z nx)” + Ty AA" + guA,O"x 


2 
+ TAA" [2v0 +0? + x7] + gA” (oax) (4.38) 


1 2_ Heo 2 2 À, 2 2)2 
+ ~(0,0) ao” — Avo(o" +. x7) — —(0° + x"*)*, 
2 2 4 
where u2 = —2y? and the constant term in (3.155) is not displayed. We see that in addition 
to the kinetic energy terms for the gauge fields and for o and y there are two new quadratic 
terms. The third term in (4.38) has the form of a mass term for the gauge field with mass 
gv. This mass can be interpreted as arising from the interaction of the gauge field with the 
condensate of Higgs fields. The fourth term in (4.38) is proportional to the gauge field times 
the derivative of the Goldstone boson field x. To interpret it, we can recombine the x and 


A terms as 
22 


v 
E f Gy, ASA, ax, (4.39) 
suggesting that 0,x/gv is the longitudinal component of a now massive vector field A’,. 
However, x still enters the cubic and quartic terms in a way that is hard to interpret. 

To see what is going on it is useful to use the Kibble reparametrization (Kibble, 1967), 
which makes the physical particle content manifest and was already introduced for the 
global case in Section 3.3.5. While working at the level of the classical field theory, one can 
define new fields 7, € related to o and x by a non-linear field redefinition: 


vy+tot+ix 
V2 
For small ø and x one has approximately 
y+ 1) yt+tn+i€ 
v2 v2 


so that 7 ~ o and £ ~ y. The fields 7 and € therefore represent the massive and Goldstone 
scalars, respectively. In terms of the new fields, 


ae = exp(ié/v) (=) , (4.40) 


v2 


expli£/v) ( (4.41) 


1 ge nj? 1 iG À 
— =p" pe! A’ Iu | | 2 N n2 3 4 : 
where Ant 
A = Ap + Fig = LA, — GA, (4.43) 


gv 
which is clearly the Lagrangian density for a massive vector field, including 3- and 4-point 
gauge and self interactions for the massive scalar 7. The Goldstone boson has disappeared 
from the theory. 
One can interpret (4.42) as the result of choosing a special gauge. Before quantizing one 
can make a U(1) gauge transformation? as in (4.6), choosing (x2) = —€(x)/v. Then 


OR 


gv 


y+” 


V2 


3€ — ¿' = € + B(x)v, which is known as a shift transformation. 


olx) > p' (x) = eE g = ( ) , A, (x) > Al, = A, + (4.44) 
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Equation (4.42) then follows from (4.36) by gauge invariance, £(A,¢) = L(A’, ¢'). This 
unitary gauge is useful because it makes the physical particle content of the theory manifest. 
However, it is not so useful for explicit calculations, especially in higher orders where it is 
rather singular. 

The number of degrees of freedom was not changed by the Higgs mechanism. Before 
SSB there were two massless gauge degrees of freedom and two Hermitian scalars, while 
afterwards there are three gauge degrees of freedom and one massive real scalar. 

Now let us consider the non-abelian case with an arbitrary scalar sector. Just as in 
the general discussion of the Nambu-Goldstone theorem, it is more convenient to choose a 
Hermitian basis for the scalars for formal manipulations, though a complex basis may be 
simpler for concrete calculations. We assume there are n Hermitian scalar fields ¢,, which 
can be arranged in a column vector ¢ = (ġ1 Øn)? with VEV v = (0|¢|0). As discussed in 
Section 3.3.6, we assume that M of the generators are not broken, Liv = 0, i = 1--- M, while 
the remaining N— M are broken, i.e., L'v 4 0, i = M+1---N. Then, for a global symmetry 
we expect that there will be N—M massless Goldstone bosons and p = n—(N—M) massive 
physical scalar particles. According to (3.173) the Goldstone bosons are linear combinations 
of the original Hermitian fields, corresponding to the directions of the massless eigenvectors 
iLîv, i = M+1---N. These span an N — M dimensional vector space, but need not be 
orthogonal. (The i is because the L’ are imaginary). We can therefore label the n scalars 
so that 


vı +01 vı +M 
o= aes = eli Eim FL] | YP : w |= ely +7), (4.45) 
p 
Xn 0 


in analogy with (4.40). The n — p = N — M fields y; in the first form are associated with 
the subspace spanned by the iL'v. There are no VEV’s in those directions: we are working 
in a Hermitian basis, which implies Lê = —L’? = —L** and therefore that (v|Liv} = 0. 
The p fields o; will be associated with the massive scalars. Not all of the v;, i = 1---p, 
are necessarily non-zero. The second form is a polar reparametrization. Only the broken 
generators are included in the exponent, and the € are the Goldstone boson fields. Unlike 
(4.40) we have not attempted to normalize the €* by factors of v, as it is not needed for a 
gauge symmetry in unitary gauge.* 

The Goldstone fields may then be removed by going to the unitary gauge, i.e., the 
Kibble transformation, just as for the U(1) example. Gauge invariance ensures that £ will 
be unchanged in form under the gauge transformations in (4.2) and (4.24). In particular, 
choose pt (x) = —€(x), i= M+1---N, and 8*(x) = 0 otherwise. Then, L(A, ¢) = L(A’, g’), 
where ¢! =v +n. A’ is given generally by (4.24), but for small £ is 


; ; : 1 ; 
Ali = Al + ijk AE + gone (4.46) 


which contains the longitudinal term 0,,¢’. The relevant term for our present considerations 
is the gauge covariant kinetic term for the Hermitian scalars. This becomes, in terms of the 


“For a global symmetry the € fields would have a non-canonical kinetic energy matrix, and field redefi- 
nitions similar to those in Section 2.14 to restore the correct diagonal form 4 (OnE)? would be required. 
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new variables, 


1 1 = Pe eee ee: ee 
5 PudDho > 3U +0)" (On — igA" -L)( Oy + igAy- L)(v +n) 
1 imag a d irj iu gj 
= zM5A PAI + zPunD"n +9? (TL Lin) AA, 


(4.47) 


where the primes have been dropped for notational simplicity. The Goldstone fields € have 
disappeared from the theory, reemerging as the longitudinal gauge field components, and 
the gauge bosons corresponding to the broken generators have acquired mass, described by 
the first term in the second line. The second term is the normal gauge covariant kinetic 
energy for the physical 7 fields, which include the three and four point gauge interactions, 
and the last is an induced cubic gauge interaction. 

The gauge boson mass matrix in (4.47) is 


Mj, = M}, = 9? v7 L' Liv 
9 be is 4.48 
= glv] Li |v) = 9g (L'v|Liv) = g? Lvi (L v)a, a8) 
g g a 
a=p+1 


where the inner product is defined by (xz|y) = X v%yq. The induced cubic term in (4.47) 
may be similarly rewritten as 


g (vT L'L’n) Ait AI, = g°(Liv| Lin) AAT = g° (v| Lİ Lin) Att AI, 
A IA AAT (4.49) 
=9 X` (Lvi n)a AHAI. 


a=p+1 


It is easy to prove, using the explicit form in (4.48) and the fact that the representation 
matrices L’ are antisymmetric and purely imaginary, that M? is real, symmetric, and has 
non-negative eigenvalues. From (3.174), M? must have the block-diagonal form 


M? = ( ; p i (4.50) 


where the upper diagonal block is M x M dimensional and M? is (N — M) x (N — M) 
dimensional. There are therefore M massless gauge bosons A!---A™, corresponding to 
the unbroken generators, and N — M massive gauge bosons, corresponding to the N — M 
non-zero eigenvalues of M?. The N — M Goldstone bosons have been eaten to become the 
longitudinal modes of N — M massive gauge bosons AM*+!... AN, 


4.4 THE R; GAUGES 


The unitary gauge for a spontaneously broken theory, introduced in Section 4.3, is extremely 
useful for identifying the physical states of the theory, but it is not very convenient for 
explicit calculations, especially when higher-order loop corrections are involved, because 
it is very singular. It is useful to work instead in a new class of gauges called the Re 
gauges (Fujikawa et al., 1972; Weinberg, 1973c,d; Lee and Zinn-Justin, 1973), which are 
less singular, though the particle content is less obvious. They are therefore better behaved 
in higher-order calculations and for proving the renormalizability of spontaneously broken 
gauge theories. 
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Consider a gauge theory with n Hermitian scalars ¢,,a = 1---n, arranged in a col- 
umn vector ¢, and m fermion fields represented by a column vector w. The most general 
renormalizable Lagrangian density with a conserved fermion number is then 


I fos 1 = = 
L= -3E FY + 5(D"6)(Dud) -V (6) +PP — mob + poa (451) 
where Fi, i = 1---N, are the field strength tensors for the gauge group G, D, and P are the 


covariant derivatives for the scalars and fermions, and V is the scalar potential containing 
gauge-invariant terms up to order ¢*. There may be present a fermion bare mass term 
described by the m x m matrix mo = moL PL + morPr, with mo, = mip, if it is allowed 
by the symmetries of the theory. Similarly, there will in general be Yukawa interactions 
between the fermions and scalars described by m x m matrices I° = [¢ Pr + %Pr, with 
=r. 

Just as in Section 4.3, we define a column vector v = (0|¢|0) of VEVs, where va = 0 
and some (L'v), Æ 0 for a = p+1---n, where n — p = N — M is the number of broken 
generators. In an arbitrary gauge one can write ¢ = v+¢’, where ¢’ is the shifted field with 
(0|¢"|0) = 0, 

vi + Qi 
p=v+g = : ; (4.52) 
Un + bn 
We saw in (4.45) that ¢!,, a = 1---p, represent physical scalars, while ¢/,,a = p+1---n, 
are associated with the Goldstone degrees of freedom. The latter disappear in the unitary 
gauge, which can be defined by the condition 


(L'v|¢') =0, i=1---N. (4.53) 


In a general gauge, ¢’ will include the unphysical Goldstone bosons. Similar to (4.38) we 
can rewrite (4.51) in terms of the shifted fields 


L= [FIRM 4 DPED g) 


4 o 
Mọ min ; ; E 4.54 
+ ae uae — ig(v|L'O"d') Al, + g? (v|L' Lig!) At AI ( ) 
—V(v +d) +YP — mo +T vay + ood, 
where L = Lg = —L” are the representation matrices for the Hermitian scalars. The terms 


in the first line represent the gauge and Higgs kinetic energies and gauge interactions, just 
as in the non-spontaneously broken case (Figure 4.1). Both the physical and the Goldstone 
boson states are included. The first term in the second line is the induced gauge boson mass 
term. It is of the same form as in the unitary gauge, with M? given in (4.48). The next 
will be cancelled by terms added to £ to fix the gauge, and the third is the induced cubic 
interaction, leading to the ġ, A1,A7, vertex 


ig? QuuVa (LiL) + LAL )ap (4.55) 


shown in Figure 4.2. In the last line, V becomes the scalar potential for ¢’, including mass 
terms for the physical states and scalar self-interactions. The n x n dimensional mass- 
squared matrix fi? is given by (3.169), just as for the case of a global symmetry. It has 
p (generally) nonzero eigenvalues corresponding to the physical scalars, and n — p zero 
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eigenvalues corresponding to the Goldstone bosons. The next term in (4.54) includes the 
fermion mass matrix 


m = mo — Iva = MmLPL + MRPR, m, = mh, (4.56) 


which may in general have both bare (mo) and spontaneously-generated pieces. m may 

involve y°’s unless mz = mr. Techniques for “diagonalizing” m to obtain the fermion mass 

eigenvalues and eigenvectors are described in the standard model context in Section 8.2.2. 

The Yukawa interactions involving the shifted fields are given by the last term in (4.54). 
The Re gauges are defined by the condition 


F' (A(x), (2), £) = a"(2), (4.57) 


where 


Fis VE lanai, + FAA), (4.58) 


€ is a real parameter which runs from 0 to oo and specifies the gauge, and a‘(x) is a 
number that depends on z only. a’ is averaged with an exponential weighting factor in the 
quantization procedure, so its precise value is unimportant. The limit € — 0 corresponds to 
the unitary gauge. 

The quantization procedure in the R¢ gauges introduces additional terms into the effec- 
tive Lagrangian density that modify the structure of the vector and scalar propagators. The 
Goldstone bosons have not been removed from the Lagrangian, and occur in internal lines 
in Feynman diagrams. They do not occur as external states (except in connection with the 
equivalence theorem, introduced in Section 8.5.1). The quantization also introduces terms 
that can be represented by a set of N ghost fields m, i = 1,---,N. These are fictitious 
particles that do not correspond to physical states but do circulate in internal loops. They 
are needed to ensure unitarity and renormalizability, and can be treated like complex scalar 
fields except that they obey Fermi statistics. The ghost vertices are given by the effective 
interaction 5 

L ghost = g(O"'n! ein nj A + ntm (v| L’ L? o) (4.59) 
and are shown in Figure 4.2. There is a factor of —1 for each closed ghost loop. ni and 7; are 
to be viewed as independent anticommuting c-numbers, not necessarily related by Hermitian 
or complex conjugation. The notation emphasizes that there is an effective propagator 
iDg(a — x’) = (0|T (n(x), n*(x’)]|0). Clear discussions and derivations of the formulae may 
be found in (Weinberg, 1995; Pokorski, 2000). 

The momentum space propagator for the gauge bosons in an arbitrary Re gauge is 


kekY(1—1/€)] 1 
k? — M?/€ | k2 — M?’ 


iD (k) = -i o” (4.60) 


which is an N x N matrix in the space of gauge indices. For practical calculations it is 
often convenient to rewrite this (within the subspace of broken generators) as 


an g RER” 1 i k” k” 
iD (k) = —i (o WE ) POM MR ME (4.61) 


The € dependent piece, which will ultimately cancel other € dependent terms in the Higgs 


5There is an implicit +ie in each propagator, e.g., 1/(k? — M?) > 1/(k? — M? + ie). 
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o Fa H i, K i. p 
`S Fi \ Y. Pi 
pa nannan k, u KiE v (LiL ap 
ON d —IPuCijk P 
b j, v j j b 


ig’ gJuvVal L LÍ + DL) 


Figure 4.2 Left: Induced three-point vertex between one scalar and two gauge fields, 
derived from a four-point interaction with one external scalar replaced by its vac- 
uum expectation value. Vertices for ghost interactions with vector (middle) and 
scalar (right) fields. The dotted lines represent ghost propagators. Ghost-ghost- 
vector vertices involve one outgoing and one incoming momentum, with the outgo- 
ing one appearing in the vertex factor. (Some authors place a dot near the outgoing 
line to indicate which momentum appears. ) 


and ghost propagators, is singled out. The ghost propagator, which is also an N x N matrix, 
is ; 

—i 
i? MJE 
The Higgs propagator is an n x n matrix in the space of scalar field indices. It can be written 
as 


iDg(k) = (4.62) 


a a 
iAg(k) = (I H ; 4.63 
iAg(k) = (1 - P) a + Pisa ses 
where P is defined as the projection operator onto the N—M dimensional space of Goldstone 
fields spanned by the vectors iL’v, i = M +1---N. It is given explicitly by 


; 1 ; 1 ; 
— p2|ri EE T E * 
P= Pr (sp), CS Pa =E (Ge) Es, ae 
where i 
0 0 
— = 4. 
w=(o g) a 
is the inverse of the vector mass-squared matrix in the subspace of massive vectors (and 
zero otherwise). The scalar propagator in (4.63) decomposes into two pieces. The first, 
proportional to I — P, represents the propagation of real physical scalars with mass-squared 


matrix u2. The second term is the Goldstone boson contribution, which is present in an 
arbitrary € gauge. It is shorthand for 


[Pair] = 7: arr (o (4.66) 


Using Pu? = 0, (4.63) can be rewritten 


i ig? i . 
iasta = [a] + aT o. (Lins, (467) 
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where the first term includes both the physical and Goldstone scalars. 

We see that the propagators of gauge, ghost, and Higgs fields can be decomposed into 
pieces involving the propagation of physical particles, namely the first terms in (4.61) and 
(4.63), as well as pieces that involve unphysical poles at k? = 3E that depend upon the 
gauge. These unphysical poles do not correspond to physical particles, and are included 
only in internal lines in Feynman diagrams. They cancel in the final expressions for physical 
on-shell amplitudes, i.e., S matrix elements, which are necessarily gauge invariant. 

Re gauges for € # 0 are referred to as renormalizable gauges because the propagators 
are well-behaved for k? — oo, falling as 1/k?. In principle it is best to do calculations with € 
arbitrary so that (a) the Feynman diagrams are well-behaved and manifestly renormalizable, 
and (b) so that the cancellation of the € dependence in the physical S matrix elements can 
be used as a check on the correctness of the calculation. In practice, however, it is messy to 
carry out calculations for an arbitrary €, and therefore people often make use of particular 
gauges for calculations or formal arguments. The € = 1 gauge is known as the ’t Hooft- 
Feynman gauge. The gauge propagator takes the particularly simple form 

igt” 
iDY = a (4.68) 
so the € = 1 gauge is often convenient for carrying out concrete calculations. Another useful 
gauge is € + oo, known as the renormalizable or Landau gauge. The propagators again take 
a relatively simple form in this limit 


i (gt ~ 4) 


k2 — M2 


PGR . _ i ; E i 
iDọ (k) = , %tDelk)= Re iAg(k) = Poe (4.69) 
but this gauge still contains ghosts and unphysical scalar fields. 

The above gauges are best for calculations of higher orders. However, the € —> 0 or 
unitary gauge is particularly simple for displaying the physical particle content of the theory. 
The propagators become 

i v k” k” ° 
—i (g"” — ; : i 
mi diia — a ) , iDg(k)=0, iAg(k) =(1- Pap a 


ID (k) = (4.70) 
That is, there are no ghost fields and only the physical, non-Goldstone scalar fields survive. 
If one is only interested in calculating at tree level, the unitary gauge is very convenient. 
However, the gauge boson propagator is badly behaved as k — oo; it approaches a constant 
rather than falling like 1/k?. It therefore induces severe ultraviolet divergences in higher- 
order calculations that must be handled very carefully. 

The ghost fields do not entirely disappear from the theory in the unitary gauge (Wein- 
berg, 1973c,d). There is an effective multiscalar interaction 


Ly = —i6*(0)Tr In(I+ J), (4.71) 
where 
Ig =9 (4) E) (4.72) 
K M? ik i l 


The trace and matrices in (4.71) and (4.72) are restricted to the N—M dimensional subspace 
of broken generators of G. Lg is a remnant of the ghost loops that survives as € —> 0 because 
of the factors €~! in the ghost-ghost-scalar vertices, which cancel the zeroes in the ghost 
propagators. Lg is necessary to cancel divergences due to gauge boson loops in multiscalar 
interactions. 
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Complex Scalars 


It is straightforward to reexpress the results for the interaction vertices and propagators in 
a complex scalar basis, either by “starting from scratch” or by using the formal results in 
(3.71)—(3.73). Writing ¢ = v + ¢’, where the fields and VEVs are now complex, the induced 
cubic and ghost-Higgs vertices in (4.54) and (4.59) are replaced by 


Laag = 9 [WLL g) + (9L Lv] A, AT 


Ling! = gri [(v|L Did’) + (JL vy] , 
respectively, while the gauge boson mass matrix in (4.48) becomes 
2 2 ipi iri 
Mj = g° (wL L + LAL" w). (4.74) 


The projection operator P onto the Goldstone subspace is usually obvious, but is best 
worked out in the Hermitian basis for complicated cases. 

Let us illustrate the Rẹ constructions for an SU (2) gauge theory involving a single 

complex scalar SU (2) doublet. (This is a simplified version of the standard SU (2) x U(1) 
model with the U(1) gauge coupling, and therefore electromagnetism, turned off.) The 
Hermitian SU(2) gauge fields Aj,, i = 1---3, will sometimes be re-expressed as 
Al, FiA? 
A= ra ASA. (4.75) 
where A+ = (A~)!, and the labels + and 0 are suggestive of the electric charges that will 
emerge when the model is promoted to SU(2) x U(1). The gauge self-interactions for the 
A fields are derived in Problem 4.8. Similarly, the scalar doublet can be written 


o=(% ), a= (% ), (4.76) 


where the conjugate fields are defined as ¢~ = (¢+)' and $°* = (¢°)'. The most general 
renormalizable gauge invariant potential for ¢, 


V(b) =+ H h +A H)’, (4.77) 


is identical to the Higgs potential in the standard model, and will be described in more 
detail in Section 8.2. Here we note that \ > 0 is needed for vacuum stability. For u? > 0 
there is no SSB, i.e., v = (0|¢|0) = 0, while v 4 0 for u? < 0. 

In the unbroken case, v = 0, the complex scalar fields ¢* and ¢° are degenerate with 
mass u, and their self-interaction terms are 


Ly = -Vr = -AH p)? = -AGTH + ogy. (4.78) 


The scalar gauge interactions® can be obtained by writing the general form in (4.21) in 
terms of components, using L’ = 7t /2 and Atr = A973 + J/2Atrt+ + V2A-77: 


— if (g Feot — gag) 49 — -2 (go Beet) A- 
Le = iZ (o Drot —¢ org) A9 ise det) A; 
_ «9 (a-Bvge\ at 4H d-at 4 6G) de a 
iia (9 4d") Ak + Fd ot + gO) AM Ay, 
6The quartic term is the product of the SU (2) singlets Av Ay and @~ et +4 ¢° = ¢t¢. This is because 


the product of two doublets transforms as 0+ 1 under SU(2) while that for two triplets is 0+1+2. However, 
the 1 is antisymmetric for the triplets and vanishes in this case, so only the singlet components can enter. 


(4.79) 
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where A”. A, = 2AtHAT + AAD. The diagonal quantum number associated with T? is 
conserved. 

For u? < 0 the SU(2) symmetry is completely broken. Similar to the U(1) example in 
(3.153) on page 119 the minimum occurs for v = v/V2, where |v|? = —p?/. Without loss 
of generality, we can choose v to be real and in the ¢° direction: other orientations of the 
minimum differ by SU(2) transformations. Thus, we can write 


v=z()) 6=( ae ) (4.80) 
~ a p ; = ee iz ; . 


where H and z are the Hermitian components of ¢” and wt = ¢*. Rewriting the potential 
in the new variables, 


A 
V($) = PH? + WwH [HP +2 + 2wtw7] +F [H? +2? + 2wtw 7], (4.81) 


where v = ,/—y?/X and we have dropped the additive constant. We therefore recognize 
that H is a physical scalar field with mass My = \/—2u? = V/2\v, while z, wt, and 
wT = (wt)! are the Goldstone bosons that should disappear in the unitary gauge. (The 
notation is chosen to coincide with the analogous standard model case.) The second term 
on the right is an induced cubic interaction. 

In the notation following (3.71) on page 101, ¢ can be written in a Hermitian basis as 


w1 0 
v+ H v 

Ph = nie = Vh + Oh, Vh = 0 5 (4.82) 
z 0 


where wt = (w! + iw?)/v2 and the ordering of the components differs slightly from the 


convention in (4.52). Then, using the SU(2) representation matrices in (3.76), 


i 1 
1 2 
L} Vh = ; Liv, = Liu, =} 


5 (4.83) 


2 


oy; oo 


0 
ij 0 
2| 0 
v 


NI 
(= OS = 


establishing that all three generators are broken and that the vectors iL} vn» span the Gold- 
stone subspace. 
Substituting the representation in (4.80) for ¢, the scalar gauge interactions become 


1 a > 4> 
Lo =5 Ma AY -Au + 7 (—iw- ð wt +2 aH) Al, 


= = 

= iS CS iz] dwt) An is (w7 d(H + iz) Ay (4.84) 
2 2 2 2 

+ Donde. 4,4 (out 2 F*) Ae. A, 


where we have omitted the irrelevant ¢’ — A mixing terms. The three gauge bosons A? (or 
AF, A?) have acquired a common mass 


Ma = . (4.85) 
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by the Higgs mechanism. They are degenerate because the Lagrangian has an unbroken 
O(3) global symmetry after the breaking, as is discussed in Section 8.2.2. The derivative 
cubic terms always connect H to a Goldstone field (or two Goldstone fields to each other), 
so they disappear in the unitary gauge. (There are important analogs involving physical 
Higgs fields in models with extended Higgs sectors, such as in the minimal supersymmetric 
extension of the standard model (MSSM).) The induced cubic term vH A? involves only 
the physical scalar field. The projection operator P in (4.63) projects onto w+ and z, while 
I —P projects onto H. 

As a simple example, let us consider the amplitude for HH —> At A™ in an Re gauge, 
with the tree-level diagrams shown in Figure 4.3. Using the gauge and scalar propagators 
in (4.61) and (4.63), and the interaction vertices in (4.81) and (4.84), the amplitude is 


pee oat 
Oe. sani igv’\ . 
M= (=) 63, LD. (ps — p1) (=) eq, + (pi © p2) 


+ [ (=) eb“ [ilps — Pi) yp + tip] tDw+ (p3 -»s) 


: (4.86) 
(+19 \ very. . 
x [ (+) €4"[+i(pe — pa)v 4 inn + (pi + p2) 
i ee . 
+ (=) €3°€4tDy(pi + p2)(— iàv)3!, 
where the 3! in the last term is combinatoric. The propagators are 
ktk” 1 i k” k” 
iDH (k) = —i (a ) 
At M2 k2 — M2 M2 k2 — M2 
5 A A is (4.87) 
i i 
iD,,+(k) = =— iDy(k) = =. 


Using (4.85) as well as p3 - eš = p4- ej = 0, the wt exchange term cancels the €-dependent 
part of the At term, leaving the At and H exchange diagrams evaluated in unitary gauge. 
This illustrates the general result that physical on-shell amplitudes are independent of €. 


At (ps) A’ (pi)  A* (ps) A` (Pa) = -A* (ps) A’ (Pa) 
r S i F ye 
i = 1 = : 
ee ee H 
r A 4 e wt 4 L 
H(p) H (P2) (pi) H (p2) H(p) H (p2) 


Figure 4.3 Tree-level diagrams for HH —> ATA" in an Rẹ gauge. There are two 
additional u-channel diagrams obtained from the first two by pı © pao, 


4.5 ANOMALIES 


Anomalies refer to quantum effects that break the symmetries associated with the classical 
equations of motion. In particular, they may occur when the diverences in a theory cannot be 
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regularized in a way that is consistent with the original symmetries. The Adler-Bell-Jackiw 
anomalies (Adler, 1969; Adler and Bardeen, 1969; Bell and Jackiw, 1969; Bardeen, 1969; 
Adler, 2004) are singularities associated with the fermion triangle diagram contributions to 
the vertex of three currents, as shown in Figure 4.4. If one or three of the vertices involve 


J 
i a 
ac] 
J 


Figure 4.4 Left: The anomalous triangle diagram for the Jį Jj JF vertex. Right: The 
analogous diagram for the triple gauge boson vertex. 


j 
Vv P 
k a 
P 


an axial vector (y°) coupling, as can occur for a chiral symmetry, the diagram diverges 
linearly, leading to an anomalous divergence of the currents in perturbation theory that is 
not revealed by formal manipulation of the field equations. If one or more of the currents is 
associated with a global symmetry of the theory, the anomalous divergence does not cause 
any particular problems, and it can even be useful (Adler, 1969; ’t Hooft, 1976a), as we 
will see in Sections 5.2 and 5.8.3. If the currents are all associated with gauge symmetries, 
however, then the diagram contributes to the triple gauge vertex and cannot be regularized 
in a way consistent with gauge invariance, implying that the renormalizability of the theory 
is lost. 
The anomaly coefficient A;;, in the vertex of currents i, j, and k is (Georgi and Glashow, 
1972) 
Aijk = 2Tr L4 {L} , LẸ} = 2Tr Lh{ Lh, LE}, (4.88) 


independent of the fermion masses. We require that each Aij must vanish for a renormal- 
izable gauge theory, both when the three currents are all associated with the same group 
factor and when they are associated with two or three factors in a direct product group. 

Another constraint comes from the (universal) interaction of fermions with gravity (see, 
e.g., Weinberg, 1995), which leads to a breaking of gauge invariance in the presence of a 
gravitational field, proportional to the trace anomaly 


T; = Tr L$ — Tr Lh. (4.89) 


We therefore require for this to vanish as well. 

Ajj, and T; vanish for chiral theories (Li, = L',), and also if both Li, and Li are 
real (equivalent to —L’/'p in a Hermitian basis) since Tr M = Tr MT. The only simple Lie 
algebras in Table 3.3 that admit complex representations are SU(m) for m > 3 (which 
includes SO(6) ~ SU(4)), SO(4m + 2) for m > 2, and Eg, so anomalies can occur only for 
gauge groups that include these factors’ or U(1)’s. 

There are no anomalies for pure QED or QCD, since they are non-chiral. The full SM is 


’The anomalies associated with three SO(4m +2) for m > 2 or three Eg factors vanish even for complex 
representations (Georgi and Glashow, 1972; Okubo, 1977). 
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based on SU(3) x SU(2) x U(1), with SU(2) x U(1) chiral, but as will be seen in Section 8.1 
the anomalies cancel between quarks and leptons or (in non-trivial ways) between L and R. 

We discussed in Section 2.10 that instead of working in terms of the L and R-chiral 
particle fields wy rR, one could just as well express the theory in terms of the L-chiral 
particle and antiparticle fields wz and YẸ, where Wr and Y$, are related by (2.301). The 2n 
fields (Yz ~¢)7 transform under the reducible 2n x 2n-dimensional representation matrices 


i {i 0\_ fit 0 
e(a- i) ao 
where the yf transform as -L since they are basically the adjoints of the Yr. The anomaly 
conditions in this basis are 
Aijk = 2Tr LL") = 0, T; = Tr £’ = 0, (4.91) 


which are equivalent to (4.88) and (4.89). 


4.6 PROBLEMS 


4.1 The spontaneous breaking of a global or local U(1) symmetry allows one-dimensional 
cosmic string classical solutions, analogous to the two-dimensional domain walls considered 
in Section 3.3.2. Consider the complex scalar ¢ in a U(1) gauge theory, with 


L= -IFE + (Du) DY -V(8), VG) = wtb +a (G19), 


where D”ġ = (04 + igA“)d, Fur is the field strength tensor, \ > 0, and p? < 0. It is 
convenient to rewrite 


1 5\2 =e 

V(¢@) = NOLE 5") 3 W= = 

where we have dropped the irrelevant constant —\v*+/4. It was shown (Nielsen and Olesen, 
1973) that there is a classical solution for (x) and A” (x) corresponding to a vortex or 
string running along the z axis from —oo to +00. In cylindrical coordinates (r,0,z) the 


solution exhibits 
—iné 


V 
Erea 
so that V > 0 for r > oo. Single-valuedness requires that n is an integer. Find the asymp- 


totic expression for the vector potential A“ for the string solution, for which F#” — 0, 
D"@ — 0, and calculate the magnetic flux f B - dS = {[(v x A)- dS for the solution. 


4.2 As mentioned in Section 3.2.5 it often occurs that the spontaneous breaking of a 
continuous symmetry leaves a discrete subgroup unbroken. If the original symmetry was 
local, the remaining subgroup is known as a discrete gauge symmetry (e.g., Ibáñez and Ross, 
1992). As a simple example, consider a model involving three complex scalars ¢;, i = 1---3, 
with a U(1) gauge symmetry. Suppose the potential is of the form 


3 
V (41, b2, 63) = X. Vil! pi) + Veubics 
=l 
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where Vilel gi) are quadratic functions of pioi (i.e., quadratic and quartic in ¢; and o!) 
and 
Veubic = o1gi1b2$3 + ong oh + h.c. 


(a) Find the U(1) charges of the ¢; for which the theory is U(1) invariant. 

(b) Show that 01,2 can be taken to be real w.Lo.g. 

(c) Suppose the V; are such that the minimum of V occurs for (0|@3|0) 4 0 but (0/¢1,2|0) = 0. 
Show that a discrete Z3 symmetry remains unbroken. 


4.3 Let t, i =1---m?—1, be m Hermitian scalars transforming according to the adjoint 
representation of SU(m). Define the m x m matrix ® = }>, L’ as in (3.31), where L’ are 
the fundamental representation matrices Lt. Show that the gauge covariant derivative is 


D,® = 0,0 + ig |A,- 1,9]. 


4.4 From (3.19) the defining (vector) representation of SO(m) consists of the m(m — 1) /2 
antisymmetric imaginary m x m matrices. These are conveniently labeled by two indices, 
with g E Ea 

(L°) ap = —1(5055 — 5552), 
where i,j,a, and b all range from 1 to m. Clearly, L’) = — Lt and LË = 0. One could 


restrict the indices so that 7 < j, but it is convenient not to do so, provided one is careful 
about double counting. The trace is 


Tr (LY LM) = 2/55! — 66I"). 
From the vector representation, one has the Lie algebra 
[T T*'] =i (—6F T" _ STIE ae ETI! ae oT) ; 


where the generators T have the same labelling convention as L”’. (It is instructive to 
specialize these relations to m = 3.) 

(a) Calculate the gauge covariant derivative (D#®)a for a scalar or fermion field ®,, a = 
1---m, transforming as a vector under SO(m), labeling the gauge bosons by Aj? = — Aj! 
and the gauge coupling as g. 

(b) Calculate the field strength tensor FJ, for Aj. 


4.5 The gauge transformation for a non-abelian gauge field is given in (4.24). 

(a) Verify that (4.24) reduces to (4.26) for small |6Ż]. 

(b) Verify the transformation (4.22) for a matter field. 

(c) Use (4.24) to prove (4.30) for all B (i.e., not just small |3*|). This implies that Lien, is 
invariant. 

(d) Rederive (4.30) by the simpler method of first proving ig w -L= [D,,, Dy], with D, 
from (4.17), and then showing that UD,U~' = D!, = 0, + igĀ,, vi 


4.6 Derive the gauge vertices in Figure 4.1 for Hermitian scalars. 


4.7 Derive the triple gauge vertex rule shown in Figure 4.1. 
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4.8 Specialize the 3- and 4-point gauge vertices in Figure 4.1 to the case of SU(2). De- 
fine the fields A+ and A®, as in (4.75). Show that the vertices for Ajt(p)A?(q)Az(r), 
Af A) AS A7, and At At AZ AZ are, respectively, 


igCyve(P,G7), tg? Queve, i197 Quvpcs 


and that the others vanish. All of the particles and momenta flow into the vertices. 
Cuvo(p, q,r) is defined in Figure 4.1, while 


Quvpo = 29uvIpo SE GupJva = Juco Gup- 


Note that Q is symmetric in u 4 v or pa, that Quvpo = Qpouv, and that 


Quvpo + Quovp + Qupve = 0. 


4.9 Prove that the gauge boson mass matrix in (4.48) is real, symmetric, has non- 
negative eigenvalues, and that N — M eigenvalues are non-zero. Hint: define an eigen- 
value and normalized eigenvector of M? as \ and w, i.e., M?w = Aw. Use the fact that 
à = wT M?w = wiMju;. Recall that the vectors Liv span an N — M dimensional space. 


4.10 Derive the effective multiscalar interaction in (4.71). 


4.11 Extend the SU(2) model on pages 149-151 (with u? < 0) by the addition of a non- 
yt 
wy? 
(a) Find the interaction vertices for Yt > Y? At, wt > yt A®, and A? = ATA-. 

(b) Write the tree-level amplitude for Yt (p1)Y~ (p2) > At(p3)A™ (p4) in the Re gauge. 
Show that it is independent of €. 

(c) Show that the individual diagrams (for the longitudinal polarizations) grow « s for 
s> M3, m3, but that the leading term cancels in the full amplitude. (The implications for 
unitarity and renormalizability will be discussed in Chapter 7.1.) 


chiral fermion doublet Y = with mass my. 


4.12 (a) Calculate the anomaly coefficient for a left-chiral fundamental representation of 
SU(m) in terms of the d;;, defined in (3.26). 

(b) Consider a field Yap that transforms as the (reducible) direct product of two SU(m) 
fundamentals. It follows from (3.118) that the corresponding representation matrices are 
(Lb) ee = (Li 60a + ala) which can conveniently be written in direct product notation 
as Li, = L'@I+I1@L’', where I is the m x m identity. Calculate the anomaly coefficient for 
a left-chiral field transforming as pap. Hint: Tr A & B = Tr A Tr B, and (A® B)(C 8 D) = 
AC & BC. 
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CHAPTER 5 


The Strong Interactions and 
QCD 
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The modern theory of the strong interactions is quantum chromodynamics (QCD). The 
basic ingredient is that each of the six flavors or types of quark, u,d,s,c,b, and t, has an 
additional quantum number color, which takes the values a = 1,2,3, or red (R), green 
(G), blue (B), and that there is an unbroken non-chiral SU(3) gauge symmetry acting on 
the color index. Thus, there are 8 massless gauge bosons (gluons), G’, and a strong gauge 
coupling gs and strong fine structure constant a, = g2/47. In this chapter we survey the 
properties of the strong interactions and QCD, especially the symmetry aspects at short 
and long distances. For more detailed treatments, see, e.g., (Brock et al., 1995; Pich, 1999; 
Ellis et al., 2003; Sterman, 2004; Kronfeld and Quigg, 2010; Salam, 2010a; Skands, 2013; 
Altarelli, 2013; Trécsdnyi, 2015; Patrignani, 2016). For the historical development, see, e.g., 
(Gross, 2005; Leutwyler, 2014; Fritzsch and Gell-Mann, 2015). 


u 


gs gs gs g? 


Figure 5.1 QCD interactions. 


The quark-quark interaction diagram via one-gluon exchange and the gluon three- and 
four-point self interactions are shown schematically in Figure 5.1. As discussed in Chapter 
2, the dominant higher-order vacuum polarization diagrams in the gluon propagator, shown 
in Figure 5.2, can be absorbed into an effective (logarithmically) running coupling gs(u?) 
that depends on the renormalization scale. Higher-order corrections are minimized if one 
chooses this scale comparable to the momentum Q carried by the gluon. The quark-loop 
contributions screen the color charge at long distance, making the effective force weaker. 
However, the gluon loops anti-screen. The latter dominate for six flavors, so the strong force 
becomes stronger at long distances or low momentum (infrared slavery), and weaker at short 
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distances or high momentum (asymptotic freedom). This is in contrast to QED, which has 
only the charged particle loops, so that it becomes stronger at short distances. 


\ N of 
\ gs(Q) gs(Q) 


k we 7 
Om 


` i: 
\ ab I A 
X 


4 


/ \ 
f ` 


Figure 5.2 The running couplings in QCD. The upper left (quark-loop) diagram 
leads to a decrease in the effective g,(Q?) at small momentum Q (i.e., long distance 
r ~ 1/Q) due to screening of the charge, analogous to QED. The lower left (gluon- 
loop) diagrams have the opposite (anti-screening) behavior. They have no analog in 
QED. Ghost diagrams are not shown. On the right are the effective running QCD 
coupling, and a sketch of the values of as (Q?) = g2(Q?)/4m and the QED coupling 
a(Q?) = e? (Q?) /4r. The QCD coupling becomes small (asymptotically free) for 
|Q| >> 1 GeV, and large (of O(1)) for |Q| < 1 GeV (infrared slavery). 


The Long Distance Regime 


The long distance regime, relevant for momenta |Q| < 1 GeV, is characterized by strong 
coupling, g, = O(1). It is therefore non-perturbative. Isolated quarks and gluons have not 
been observed; presumably they are confined and cannot emerge as free particles due to 
the strong coupling and gluon-self couplings [infrared slavery (Fritzsch et al., 1973)] that 
only allow color-singlet asymptotic states (hadrons). These include gg mesons M, and qqq 
baryons B, in the color-singlet states! 


l 2. _ lig 
IM) ~ 728" laiadis)» |B) ~ g Ndiaye) (5.1) 


+Other possible color-singlets will be mentioned in Section 5.9. 
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In (5.1) a, 6, and y are the color indices, and i,j, and k are collectively the flavor, spin, 
and space indices. Their strong interactions are approximately described by phenomeno- 
logical models, such as the Yukawa model of pion exchange between nucleons. Within the 
underlying QCD theory this is interpreted as an approximation to higher-order effects, as 
illustrated in Figure 5.3. Even though perturbation theory is not useful in this regime, one 
can still utilize the approximate global flavor symmetries described in Chapter 3, i.e., SU (2) 
isospin symmetry relating u and d at the few % level, and the extension to SU(3) relating 
u,d, and s at the 25% level, and their chiral extensions. 


uud ddu 


Figure 5.3 Effective Yukawa interaction, due to quark-antiquark exchange with the 
qq pair interacting via gluon exchange. 


The Short Distance Regime 


The short distance regime, |Q| >> 1 GeV, has weak coupling, gs < 1 (asymptotic freedom). 
It can therefore be described in terms of point-like quarks and gluons with interactions that 
can be calculated in perturbation theory. For example, in a deep inelastic scattering process, 
e7 p — e` X, the final e~ is observed but the final hadronic states, represented by X, are 
not observed and are summed over. (Such processes are known as inclusive, as opposed to 
exclusive ones such as e~p > ep + 3m or e~p — eA? involving a definite final state.) 
For Q? = —q? > 1 GeV? the process is described to first approximation by the photon 
interacting with a point-like quark, as shown in Figure 5.4. Other parts of the diagram, 
involving the distribution of quarks within the proton, and the process of the remaining 
quarks in the proton and the scattered quark turning into hadrons (hadronization), are non- 
perturbative effects. The concept of describing inclusive sums over final states of hadrons 
in terms of perturbative calculations involving quarks and gluons is known as quark-hadron 
duality (e.g., Melnitchouk et al., 2005). 


5.1 THE QCD LAGRANGIAN 


The quark fields are denoted qra, where a = 1,2,3 or R,G,B is the gauged color and r = 
u, d, 8, c, b, t is the (ungauged) flavor index. An alternate notation is ua = qua, etc. The Dirac 
indices are suppressed. The quarks transform under the fundamental (3) representation of 
SU(3), and are non-chiral, Li, = Li, = >. The Dirac adjoint field transforms as a 3* and 
is written q2. There are 8 Hermitian gauge fields (gluons), G’ = GĦ, i = 1---8. The QCD 
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Figure 5.4 Deep inelastic scattering. The interaction between the lepton e~ and the 
quark can be described perturbatively by one photon exchange. However, the distri- 
bution of quarks in the proton, and the hadronization of the remaining and scattered 
quarks (the blobs) into unobserved hadrons X = X, + Xə are non-perturbative. 


Lagrangian density is 


1 i iv -Q5 -=Q 0 CD i Miuv 
LQocD = -gem C" +Y iPage- Y mi dra + Dana , (5.2) 


where the field strength tensor is 
Gi, = pG — OG), — gs fijk GI G5. (5.3) 
The quark gauge covariant derivative is 


DHP = (DY) „g = 88E + r (5.4) 


7 


where P . 
Gê = (ep =Y G2, (5.5) 


with GS = 0, represents the gluon field in tensor or matrix notation. The m,. in (5.2) are 
the current quark masses. They are actually generated by spontaneous symmetry breaking 
in the full standard model (including the chiral electroweak part), but can be considered as 
bare masses when considering QCD alone. Without loss of generality (for QCD) they can 
be taken to be real, nonnegative, and diagonal in flavor. 

The interaction terms in (5.3) and (5.4) are the same for all six flavors, so they are 
invariant under a global chiral U(6) x U(6) flavor symmetry. However, Mme, Mb, and me are 
large compared to the typical scale of the strong interactions, A ~ (200 — 300) MeV, so 
the symmetry is badly broken except at very high energy. More useful is an approximate 
SU(3) flavor symmetry in the limit Muy ~ Mma ~ msg, which is valid at the 25% level, and 
the even better (1%) SU(2) isospin symmetry, which holds in the limit Mma ~ my.” These 
symmetries are enhanced to become chiral for m = 0; e.g., for Mu = Ma = 0 the continuous 
symmetries of Lacp become 


SU(3)color X SU(2) x SU(2) x U(1) x U(1), (5.6) 
SaaS Ss Oe Seo 
gauge global global B,Ba 


In fact, mu, mg, and mg are not really degenerate compared to each other. However, my, and mq are 
both very small compared to A, so they are “degenerate” in the sense that mu/A ~ ma/A ~ 0. Similarly, 
SU(3) holds approximately because ms ~ 100 MeV (> mya) is smaller than A though non-negligible. 
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while the SU(2) x SU(2) becomes SU(3) x SU(3) ifm, > 0 as well. The U(1) factors are 
separate L and R chiral baryon numbers. The sum (difference) of the generators corresponds 
to baryon (“axial baryon”) number B (B4). However, B4 is not a good symmetry of the 
strong interactions, and its unsuccessful prediction in the quark model is known as the azial 
U(1)a problem. Its resolution by non-perturbative effects in QCD will be commented on in 
Section 5.8.3. 

Another difficulty is the strong CP problem, which refers to the last term in (5.2). The 
strong interactions are observed to be reflection invariant (i.e., parity P is conserved), as 
well as invariant under charge conjugation (C), time reversal (T), and the products CP 
and CPT. The first three terms in (5.2) respect these symmetries. However, it is possible 
to add the final strong CP term to Lacp, where $gcp is a dimensionless constant, and 


ee a (5.7) 


is the dual field strength tensor. (G and G are related by exchanging the analogs of the 
electric and magnetic fields.) The strong CP term is gauge invariant and does not spoil the 
renormalizability of QCD. However, for 6gcp # 0 it violates P, T, and CP symmetries, 
and stringent experimental limits on the electric dipole moment of the neutron require 
lQgcp| < 107" — 107" (e.g., Kim and Carosi, 2010). For pure QCD it is possible to simply 
impose these symmetries, i.e., to take 9g¢p = 0. However, as will be discussed in Chapter 
10, this becomes problematic in the context of the full standard model, which has other 
sources of CP violation. 


5.2 — EVIDENCE FOR QCD 


QCD is the unique renormalizable field theory consistent with the observations that ex- 
isted by ca. 1970, and since that time there has not been any serious competing theory.’ 
Nevertheless, it is interesting to review some of the evidence for the ingredients of QCD. 


Spin- Quarks and Spin-1 Gluons 


The first evidence for spin-4 quarks was the success of the constituent quark model, which 
successfully classified a large number of hadrons in terms of three flavors of quarks. The 
spin-5 nature is essential for this classification, as is evident from the construction of the 
nucleons out of three quarks or of the spin-1 p from a qq pair in an S-wave. The very 
simple description of the approximate flavor symmetries of the strong interactions and their 
breaking in the quark model, i.e., (3.113) on page 110, also provided evidence. 

By 1970, dynamical evidence emerged from the deep-inelastic scattering process e~ p > 
eX, with Q? = —q? > 1 GeV’, shown in Figure 5.4 and to be described in Section 5.5. 
The rate observed at the Stanford Linear Accelerator Center (SLAC) at large Q? was much 
larger than would be expected for a “big fuzzy” proton, calling for the existence of point- 
like parton constituents of the proton, reminiscent of the discovery of the atomic nucleus 
in the Rutherford experiment. In principle, the partons could be spin-0, spin-4, or higher. 
However, the angular distribution of the scattered electron established that they are spin- 
1 


5 (the Callan-Gross relation), consistent with being quarks. This was later confirmed in 


3In fact, prior to the development of QCD many physicists seriously considered abandoning the ideas of 
field theory or of any fundamental dynamical equations for the strong interactions, in favor of the bootstrap, 
which postulated that there was a unique S-matrix consistent with the ideas of unitarity, analyticity, 
crossing, etc. (See, e.g., Eden et al., 1966; Collins, 1977). 
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the distributions observed in deep inelastic w* N and PN scattering and in their relative 
strengths. 

Additional evidence emerged a few years later in the study of ete~ — hadrons. The 
observed cross section falls as 1/s, where s is the square of the CM energy. This is consistent 
with the behavior expected for the production of point-like quarks (up to higher-order QCD 
corrections), as in (2.232) on page 46, and not with the more rapid falloff expected without 
such constituents. These later hadronize into collimated clusters of hadrons known as jets. 
The spin- nature was established by the observed 1+ cos? @ angular distribution of (2.231) 
for the jets, as opposed to the sin? @ distribution predicted for spin-0 (Problem 2.23). On 
the other hand, the positive evidence for quarks was apparently contradicted by the non- 
observation of isolated quarks. The resolution of that conflict had to await the development 
of QCD and the notion of infrared slavery. 

The first direct evidence for spin-1 gluons came from the observation of distinct 3-jet 
events from ete~ — qgG and of the planar broadening of events in which the third jet could 
not be resolved, by the TASSO and other collaborations at PETRA (DESY) in 1979 (see, 
e.g., Wu, 1984; Bethke, 2007). Another type of compelling evidence is indirect, i.e., the 
observed asymptotic freedom of the strong interactions requires a non-abelian gauge theory. 
Other advantages of color octet gluons were especially emphasized in (Fritzsch et al., 1973). 


Evidence for Color 


The observed hadrons are all color singlets. Nevertheless, with the benefit of hindsight, the 
color quantum number was already needed in the original quark model. That is because 
the quark assignments for the baryons and hyperons (baryons involving an s quark) were 
totally symmetric in the flavor, spin, and space indices. In particular, the Q7, which was 
successfully predicted by the SU(3) model (Section 3.2.3), was interpreted as 


IQT) = |stststy, (5.8) 


where the arrows all represent spin-up with respect to a reference axis. The 27 is therefore 
symmetric in flavor (all s quarks), spin (all spins in the same direction), and in space 
indices (the orbital angular momenta are zero). However, this violates the spin and statistics 
theorem, which follows from the union of relativity and quantum mechanics (see, e.g., 
Streater and Wightman, 2000), and which requires that all physical spin-4 states should be 
antisymmetric. This fundamental difficulty with the quark model is easily resolved by the 
introduction of the color quantum number, under the assumption that the Q7 and other 
baryon/hyperon states are color singlets, since the projection of 3 x 3 x 3 onto the singlet 
is totally antisymmetric, as in (5.1), 


IQT) x PTs] shal). (5.9) 


Thus, the color quantum number was needed even before the development of QCD,*, where 
it played the additional role of a gauge quantum number. 

There are other tests based on counting the number of colors that contribute to an 
amplitude or rate. The leading diagrams for ebe~ — hadrons are shown in Figure 5.5. At 
short enough distance, one may regard the process as first producing quark qra and its 
antiquark, which may be computed perturbatively, followed by hadronization, in which the 


4An apparently alternative resolution of the statistics problem, parastatistics (Greenberg, 2008), is in 
fact equivalent to the existence of the color quantum number. 
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Figure 5.5 ete — droge. Left: the blobs represent the quark hadronization. Right: 
higher-order QCD corrections. 


quarks turn into jets of hadrons such as pions (low momentum [soft] gluons or quarks may 
be exchanged between the jets to ensure color neutrality). There are also higher-order QCD 
corrections, which can be calculated perturbatively. One usually expresses the theoretical 
and experimental result in terms of the ratio 


a(ete~ — hadrons) 
a(ete” > utu) 


R(s) = (5.10) 
at CM energy ys, where the denominator is the lowest order theoretical expression in (2.232) 
with a running coupling, o(ete~ — wtp) = 4ra?(s)/3s. R(s) is convenient because the 
largest energy dependence cancels in the ratio, and experimentally because the luminosity 
also cancels. R(s) counts the number of quark colors and flavors, weighted by the quark 
electric charge-squared e?, where e, = 3 for [u, c, t] and —4 for [d, s, b]. The lowest-order 
prediction (i.e., ignoring a.) is R = Ne}, e2, where Ne is the number of colors and only 
the nq quarks lighter than \/s/2 should be included in the sum. For QCD (N. = 3) this is 


5 6 10 11 
R= 3 for [u,d], 3 for [uds], 3 for [udsc] , z for [udscb] . (5.11) 
The higher-order QCD corrections have been computed to 4 loops (Baikov et al., 2012). 
Neglecting quark masses,” one predicts 


_ 2 Os - ng (As\? | na (%s\? | na (O8\* 
nan Da lis ee (Sy rae) aa S] eam 


for ng quark flavors. The higher-order terms use the value of the running a at s. For ng = 5, 
for example, c3} = 1.40902, c3 = —12.80, and cł = —80.434. References to quark mass and 
Z exchange corrections are given in (Patrignani, 2016). The QCD (N. = 3) prediction 
is in excellent agreement with the experimental result in Figure 5.6. R also excludes an 
alternative quark model involving integer electric charges (Nambu and Han, 1974), at least 


5Similar to QED, there are infrared singularities associated with both virtual and real gluons as their 
energy approaches zero, as well as mass (or collinear) singularities that occur in the limit of a massless 
quark due to gluons radiated parallel to the quark. These can be shown to cancel under realistic conditions 
using dimensional regularization or by introducing a fictitious gluon mass. (The latter is only possible for 
diagrams not involving the non-abelian gauge vertices, since it would violate gauge invariance). See (Field, 
1989; Salam, 2010a) for detailed discussions. 
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Figure 5.6 Experimental data on R(s) compared with the lowest order (dashes) and 
three-loop (solid) QCD predictions. There are steps at the s, c, and b thresholds, 
given approximately by the locations of the ¢ (s5), J/w (cé), and YT (bb) resonances. 
The perturbative prediction works extremely well above a few GeV, provided one 
includes the threshold resonances. At high energies, Z boson exchange strongly 
dominates over one photon exchange. Plot courtesy of the Particle Data Group (Pa- 
trignani, 2016). 


under the assumption that the quarks can be treated as pointlike (Problem 5.1). Further 
tests of QCD in ete~ annihilation are reviewed in (Kluth, 2006). 

Other evidence for color and QCD includes the ratio of nonleptonic and leptonic decays 
of the W, since W7 — qq, where qq = du, sc counts the number of colors, while the leptonic 
decays into (~ i, £ = e, u, T, do not. In particular, the branching ratio 


T(W- > er) 2 1 
T(W- > qa) +r (W- 367)” 342N, Ne= 


B(W- sev )= 11% (5.13) 
(up to small corrections from QCD, fermion mass, quark mixing, etc.), in agreement with 
the experimental value ~ 10.7%. 

Another probe is the Drell- Yan process in which 


pp > ¢té- + hadrons, & = e7, a (5.14) 


at large (pe+ 4p) > 1 GeV’. This is dominated by the qq annihilation through a y 
or Z, as shown in Figure 5.7, and can be thought of as the inverse to ete” — qq. The 
dra and qf can each be in one of three color states, but only the combination in which 
a = 2 can contribute, leading to a cross section 1/N. compared to what would be expected 
without color, in agreement with observation. To see this, consider pp scattering in the 
approximation of considering the valence quarks only. Using the baryon wave function in 


(5.1), 
Ly? iy 1 
Ocolor = 3x2x2x (=) x (z) Ono-color = 3 7no-color» (5.15) 
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Figure 5.7 Left: Drell-Yan process, pp > y, Z > 47 +. Right: The 7? — 2y decay. 


where the 3 represents the three colors that can annihilate, and 2? represents the color 
assignments of the remaining quarks, all of which add incoherently.® 

One can use SU(2) x SU(2) chiral symmetry to show that the chiral anomaly associated 
with the triangle diagram in Figure 5.7 (with 7° coupling to the axial isospin generator 
T} — T?) dominates the 7? + 2y decay amplitude in the my — 0 limit (Adler, 1969; 
Donoghue et al., 2014). This again counts the number of quark colors, so that 


Ne 2 am? Ne 
T (7? > 2y) ~ (=) 3003 P = 7.75(2) (=) eV, (5.16) 


where f = 130.5(1) MeV is the pion decay constant, which is associated with the sponta- 
neous breaking of the chiral symmetry and is measured in t+ — u*v decay. The prediction 
for Ne = 3 is increased to ~ 8.10 eV by chiral-breaking corrections (Bernstein and Holstein, 
2013), consistent with the experimental value, 7.6(3) eV. 


5.3 SIMPLE QCD PROCESSES 


In this section we sketch the derivation of some simple QCD processes at tree level. Some 
of the calculations are similar to the QED calculations in Chapter 2, except that one has 
to properly take the color factors into account. Others involve the gluon self-interactions, 
which have no QED analog. Processes involving non-abelian vertices are extremely tedious 
to carry out by hand, and are best handled by specialized computer algebra programs (see 
the list of websites in the bibliography and the example notebooks on the book website). 
However, we will illustrate one relatively simple example. Higher-order calculations involve 
all of the subtleties of gauges and ghost loops (the latter may even appear in tree-level 
calculations involving external gluons), which are treated in standard field theory texts. 


Color Identities 


The calculations are greatly simplified by the use of certain color identities listed in Table 
5.1 for the fundamental representation matrices L$ = \'/2 (denoted in this section by L’) 
and the SU (3) structure constants f;;, defined in Tables 3.1 and 3.2. They follow easily from 
or are special cases of the identities given in Sections 3.1.2 and 3.1.3 and in the Problems 
in Chapter 3. 


6 An equivalent derivation is to simply average over the Ne colors of the interacting q and g, so that 


o œ (1/Ne)? ae"? = 1/Ne. 
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TABLE 5.1 SU(3) color identities? for L’ = X'/2 and fijx- 


LiL = or Tr (LILI) = 564 

Jijk fijm = 3km Jijk fijk = 24 

fiim frim fijn frin = 72 fijm frim fikn fjin = 36 
Tr (LLI L*) = 3 (dijk + i fijn) 

Tr (L'LIL*) ifijm = —35*™ Tr (LILI LE) ifijk = —6 
Tr (L'LIL'L*) = — b ôjk Tr (LİLİ LLI) = —2 

Tr (L'II LI LÀ) = 18 Tr (LLI) Tr (LILI) =2 


Jilimdjmk = Jimkdjim T. Jijmdmik =0 
fimfjmk — fimk fjim + fijmfmik =0 (Jacobi identity) 


oip = X; LÌ LŻ, I is the 3 x 3 identity matrix, and the indices run from 1 to 8. There is no distinction 
between upper and lower indices. 


qq and qq Scattering 


qq and qq scattering via gluon exchange are very much like the simple QED processes 
described in Section 2.8. Here we will neglect the quark masses for simplicity, but it is 
straightforward to include them (as would be necessary for the production of a heavy quark, 
such as uti — tt). The process grgGra > YsyGs5, where a, 3,7, and 6 are color indices and 
r Æ s are flavor indices, proceeds through an s channel gluon, as shown in the first diagram 
in Figure 5.8. The only differences compared with the QED process e~e+ —> f f in (2.225) 
on page 46 and Figure 2.15 are that —e?Q + — g? and that there are color factors on the 
vertices and gluon propagator, as shown in Figure 4.1, yielding 


=j gP 5k 
S 


Mpi (—igs üz Yu DEva) ( (—igs V2IpLigtr) 
ae (5.17) 
l i ie S _ 
= Lig Lis (tizYpv4 Boy"u1) - 
The calculation of the spin-average cross section proceeds as in eet — ff, Equa- 
tion (2.227), except it is now convenient to do a color average as well, in which one averages 
(sums) over initial (final) quark colors a = 1---3. Similarly, in processes involving external 


gluons one averages (sums) over initial (final) gluon color indices i = 1---8. Neglecting the 


alps) ~ LEs as (pa) qy (ps) , qs (pa) 
6k PEN Li Q0000 L* 
- otk 
Yd AL 4 
s L ~ =~ 


Figure 5.8 Diagrams for ¢;8Gra —> dsy%ss, where r and s are flavor indices. Only the 
diagram on the left contributes for r Æ s. 
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quark masses, the expression 2Q}e*(¢? + u?)/s?, derivable from (2.227), is replaced by 


2 3 2 2 

- 1 ior poe Gena lt tru 
MP = (5) D Hetiete (A) 
œ, ß,y,ð=1 (5.18) 


1 Si go P + u? 4 +u? 
= gTr (LL) Tr (E'L) (248) ( z ) = gat ( 2 i; 


For annihilation into the same flavor, ¢-gGra —> Gr+Grs, the calculation is similar to Bhabha 
scattering in (2.233)—(2.235). The first two terms are obtained by the replacement e+ > 
29/9, just as in (5.18). However, the third (interference) term now has the color factor 


2 


1 i i jx rj* 1 irjrirj 
goals =, CATA) = =r (5.19) 
so that F : a P 
7 4 alt 2 
iPS ag | ee (5.20) 


5 s2 t2 3st) 
The spin and color-averaged squared matrix elements for a number of 2 > 2 QCD 


processes, neglecting masses, are listed in Table 5.2. More extensive listings, including mass 
effects and extensions to supersymmetry, may be found in (Patrignani, 2016). 


TABLE 5.2 Spin and color-averaged squared amplitudes |M|?/g? for various QCD 
subprocesses,” characterized by kinematic subprocess invariants s, t, and u. 


|M|?/95 90° 
ards — OrQs (<3) 2.2 
GrIs > Urs ie 
21,2 
GrGr > qsqs t tH ) 0.2 
21,2 21,2 2 
qd qq oe a 2.6 
21,2 24 42 2 2 
qq qq eo eye ae ae 


qG > qG 
GG > GG 
4q — GoGo 
GG > qoq 


tu 0.1 
0.07 


4 
9 
4 
9 
5 
4 
9 
(nce ) pce) (e o) (2a) 
(: , 
9 
4 
9 


wn 
NI 


Ble 
— 
g 

wi 
= 
wN 
N 

J 
g~ 


ar Æ s when flavor indices are given. The last two processes involve the production of a hypothetical 
spin-0 color-triplet qo, such as one encounters in supersymmetry. Masses are neglected. The last column 
is the numerical value for CM scattering angle 90°, where t = u = —s/2 (neglecting masses). Expanded 
from (Combridge et al., 1977; Barger and Phillips, 1997). 


GG > qoo 


To illustrate a non-trivial non-abelian vertex, consider the process GG — qoo, where qo is 
a hypothetical spin-0 color triplet, such as a scalar quark in supersymmetry. There are four 
tree-level diagrams, as shown in Figure 5.9 Using the vertex factors from Figure 4.1, the 
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P35 Q Pa Ê P3, Q Pa; B 
3 1 Z 
3k, o AA 
9 eo, F %& 
Pı; îi, H P2, J, V Pisi, H P2, j, V 
P3, Q Pa, B pa, B. P3, Q 
X J ~ í 
Li Natalee L Liy--=--%Li 
SS p3 — pit, a Pi — pst, 
GA K2 F K2 
P1, 2, H P2; J; V P1; 2, H P2, J, V 


Figure 5.9 Diagrams for G% (p1)G} (p2) > qoa(p3)ĝos (p4), where qo is a hypothetical 
spin-0 color triplet. 


amplitude in Feynman gauge is 
—i 
=| Js fijk 

x [g (p2 — pr)” + g7 (2pı + pa)” — 9”? (pı + 2p2)*] 
+igsg’’ {L', L} 


M = EluE2v —igs LE (p3 = Dado ( 


m (5.21) 


+ (-igs)? (LLI) g (=) (2p3 — p1)” (p3 — pı — pa)” 


+(—igs)” (LL) ag (=) (pı — 2pa)"(p1 + p3 — pa)” | , 
where mo is the go mass, and €1,, = €p (P1, A1) and €2, = €x (P2, Az) are the gluon polarization 
vectors. The straightforward way to calculate |/|? would be to first take the absolute square 
and then use (2.121) on page 28 for the gluon polarization sums. However, this would be 
extremely tedious. The calculation would be simplified if the second term in (2.121) did 
not contribute, but this requires the calculation to be done in a gauge invariant way (cf., 
the discussion of Compton scattering in Section 2.8). In particular, for a non-abelian theory 
one must include the negative contribution of fictitious ghost pairs (Sterman, 1993, Section 
8.5; Peskin and Schroeder, 1995, Section 17.4). Although this is straightforward, it can be 
avoided by introducing explicit expressions for the polarization vectors, just as we did in 
(2.181). 

For the qoQo final state it is simpler to calculate the amplitudes rather than their absolute 
squares, analogous to the fermion helicity calculations in Section 2.9. The four-momenta in 
the CM frame are 


Piz = k(1,0,0, 1), P34 = Es (1, £8; sin 0,0, +8r cos 0), (5.22) 


where k = \/s/2, kp = ys — 4mé/2 = Bs Ey = Brk, and 6 is the CM scattering angle. The 
gluon polarization vectors are 


€1,2(1) = (0, +1, 0, 0), €1,2(2) = (0, 0, 1, 0) (5.23) 
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in a linear basis, where en (An) = €n(Pn, An), = 1,2. (The sign for €2(1) was chosen to be 
consistent with the space reflection convention in (2.273) on page 53.) Thus, 


€1(A1) « €g(Ag) = (—1)1 416412 
P1 €n(A) = pa: en(A) = 0 
p3°€1(1) = —p4 : €& (1) = -ps - €2(1) = pa €2(1) = —ky sin 0 
P3 ` €n(2) = pa: €n(2) = 0, 


(5.24) 


which greatly simplify the calculation. Denoting the amplitude by M(Aj,A2), one has 
M (1,2) = M(2,1) = 0, and only the g” terms contribute to M (2,2), 


M (2,2) = —ig; -ifatt (=) T TAIM . (5.25) 
t and u are related to 0 by 
t — må = —2k° (1 — Bs cos 0), u — mg = —2k° (1 + By cos), (5.26) 
so that e n 
( r ) = =n cos 0. (5.27) 


M(1,1) has the same g”” terms as M (2,2), as well as contributions from the t and u-channel 
pole terms. These are easily calculated, yielding 


1+ 


M(1,1) = M(2,2) 


2kFs sin? 0 
(t — mõ) (u — ms) 


= M(2,2) h | amè ( 1 a 1 )| (5.28) 


2 
t— mô u— m 


= M(2,2)[1+ X], 


where (5.26) was used to obtain the second form. The spin and color-average amplitude 


squared is 
= 1 1 
IMP =F ag AL Ma, (5.29) 
ij œp A1,A2 

where the factors of 1/4 and 1/64 are, respectively, due to the averages over the gluon spins 
and colors. There is no interference between the two terms in M(2,2) because they are, 
respectively, antisymmetric and symmetric in i and j. From Table 5.1, the color sums for 
the squares of these terms are 12 and 28/3, respectively, so that 


IMI? = 9s i Ar E ES ae (5.30) 
64 s 3 
and da _ 1 ke ge do_ 1 Wik (5.31) 
dcos@ 327s k , dt 167s? ` i 


One can obtain the amplitudes for left and right circularly polarized gluons using (2.118), 
kes sin? 0 
) 


M(L, L) = M(R, R) = 


NI = 


[M(1, 1) — M(2,2)] = -M (2,2) i (t= mB) (u— mg 


(5.32) 


M(L, R) = M(R,L) = 


NI re 


kessin” 0 
PR a | ea) 
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M(L, R) and M(R, L) vanish in the forward and backward directions because of angular 
momentum conservation, i.e., the gluon spins are in the same direction and cannot be 
compensated by orbital angular momentum.’ M(L,L) = M(R, R) and M(L, R) = M(R, L) 
follow from reflection invariance. 


5.4 THE RUNNING COUPLING IN NON-ABELIAN THEORIES 


It was already described in the introduction and (for QED) in Section 2.12.2 that many 
of the higher-order corrections to QCD (and other field theories) can be absorbed into an 
effective gs(u?) or as(u?) = gs(u?)?/4r, where p is the renormalization scale. Higher-order 
corrections are usually minimized when yz is taken to be a typical momentum scale of the 
process, such as u? = Q? = |q|?, where q might be the four-momentum carried by an 
exchanged gluon. The gluon self-interactions in QCD imply asymptotic freedom, i.e., that 
as(u?) becomes small at large u >> O(1 GeV) (short distance), so that one can treat the 
quarks and gluons as weakly coupled, and processes such as deep inelastic scattering can 
be calculated in perturbation theory. For small u (large distance), a,(u?) becomes large 
and perturbation theory no longer holds. The strong coupling and gluon self-interactions 
presumably lead to the confinement of quarks, gluons, and any colored states, so that only 
color singlet hadron states can emerge. In the real world, both regimes may be relevant to 
different aspects of a process, e.g., a quark may scatter in the short distance regime, but 
its initial distribution in the proton and its subsequent hadronization are low momentum 
processes. Fortunately, for many processes the short and long distance effects can be factor- 
ized, with the former calculable. The latter must be taken from experiment, although their 
logarithmnic Q? dependence is predicted by QCD. 

The deep inelastic scattering experiments at SLAC could be understood in the simple 
parton model of point-like constituents of the nucleon, which had been developed somewhat 
earlier to understand hadronic processes (see, e.g., Field and Feynman, 1977). However, it 
was quickly understood that the interactions of the partons had to be asymptotically free, 
so that they could appear point-like at short distances and still be confined in the nucleon. 
The breakthrough came in 1973, when it was shown that non-abelian gauge theories could 
be asymptotically free (if there are not too many matter fields), and furthermore that they 
are the unique asymptotically free renormalizable theories in four dimensions (Gross and 
Wilczek, 1973; Politzer, 1973). Combined with the evidence for three colors, QCD emerged 
as the unique candidate theory. 


5.4.1 The RGE Equations for an Arbitrary Gauge Theory 


The running of the effective gauge coupling g in a gauge theory with a single group factor 
is described by the renormalization group equation (RGE) 


dg? _ DNE ee 6 


1 loop 2 loop 
where the coefficient of the one-loop term in the 8 function is 


1 11 4 1 1 
(4r)? | 3 


Trl» (5.34) 


TThe sin? 6 factor is cancelled by the singularities from the t- and u-channel poles for me — 0, as can 
be seen from (5.28) and (5.32). 
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where C2(G) is the quadratic Casimir defined after (3.22), with C(SU(m)) = m and 
C2(U(1)) = 0. Tr, Ty,, and Tọ, are, respectively, the fermion, complex scalar, and Hermitian 
scalar Dynkin indices 


1 ee F ind 
Tebiy = 51 (LL14) + 5% (Lath) Soo Tr (1i) 


Tobg=Te (TEA),  Togdg = Tr (14,14, ) 


T(Lm) = 4 in SU(m) for a fundamental representation; for U(1) a set of fields with U(1) 
charges qa yields T = DO, g2 (or $30, (a2, + Gp) for chiral fermions). One must sum 
over all of the IRREPs in which the particle masses are smaller than Q, so that there 
are discontinuities in the slope at the particle thresholds. Equation (5.33) is easily solved 
analytically in one-loop approximation, 


(5.35) 


1 1 E n a0? 
a(@2)  a(M2) 4rbl We + 0 [a(Q’)], (5.36) 


where a = g?/4m and M is an arbitrary reference scale. Thus, 1/a(Q?) varies linearly 
with In Q?. Asymptotic freedom? occurs for b < 0. Since C2(G) and the Dynkin indices are 
nonnegative, asymptotic freedom always occurs in a pure non-abelian gauge theory, but not 
in U(1). In the presence of fermion and scalar fields, asymptotic freedom may or may not 
hold depending on their contribution relative to the gauge terms. 

The two-loop contributions to 6 are also known (see, e.g., Martin and Vaughn, 1994; 
Luo et al., 2003). [They are known to four loops for pure QCD (van Ritbergen et al., 1997).] 
However, at this order one must also include diagrams involving other interactions, such as 
Yukawa interactions or other gauge factors, which lead to a coupling between their RGEs. 
More careful treatment of thresholds and of the renormalization scheme are also needed at 
this order. 

For QCD, one has C2 = 3, Tp = “, and Tọ. n = 0, where ng is the number of quark 
flavors lighter than Q, e.g., ng = 3 for ms < Q < Me (perturbative results are not expected 
to be valid below ms), ng = 4 for me < Q < mp, and ng = 5 for mp < Q < my. Thus, at 
one loop b = —(33 — 2n,)/487? and 


2 2 
sO 7 aay tae = ago te Oe 
This can be rewritten 1 
as(Q”) = RA (5.38) 
where the scale? A is defined by 
A= M exp (aaam) (5.39) 
2b>°as (M?) 


(see Problem 5.7 for a generalization to two loops). In addition to as (Q?) becoming small 
for large Q, it also goes to œ at the scale Agcp = A, at least in one-loop approximation. 


8Other renormalizable interactions in four dimensions also satisfy RGE for their running couplings, but 
these are never asymptotically free (Coleman and Gross, 1973; Zee, 1973). 

One can define A(”4) as the value relevant to the region with nq light flavors, with the discontinuities 
at the thresholds fixed so that a; is continuous. 
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Of course, the one-loop approximation and perturbation theory break down in this limit, 
but one may nevertheless loosely interpret A as the scale at which a, becomes large. This 
treatment is easily extended to higher orders, inclusion of quark thresholds, etc. a; has been 
determined in many ways at different scales (d’Enterria and Skands, 2015; Deur et al., 2016; 
and the QCD review in Patrignani, 2016), often from the corrections to simple parton model 
results. These include hadronic t decays, T spectroscopy and decays, e+e~ annihilation 
event shapes above and below the Z, deep inelastic scattering, jet and tt cross sections at 
the LHC, and the width for Z — hadrons. Other determinations are made by comparing 
perturbative calculations of quantities such as current correlation functions or the static 
energy between color sources at close distance with the corresponding lattice evaluations. 
The running predicted by QCD is clearly confirmed, as illustrated in Figure 5.10. It is 


April 2016 
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4 DIS jets (NLO) 
o Heavy Quarkonia (NLO) 

ete jets & shapes (res. NNLO) 
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Figure5.10 Running of the QCD coupling as a function of the scale u ~ Q. The data 
points are various experimental determinations and the band is the best fit QCD 
prediction. Plot courtesy of the Particle Data Group (Patrignani, 2016). 


convenient to quote the value of a, at the mass of the Z because that is the scale at which 
the electroweak couplings are best determined. There is a direct measurement of as (M2) 
from the hadronic Z width, and measurements at other scales can be extrapolated to Mz 
using the higher-order QCD running and the value of A obtained in a global fit. The QCD 
review in (Patrignani, 2016) gives the precise average a, = 0.1181+0.0011 from an analysis 
based on the quantities with small and manageable theoretical uncertainties. 

The observed running corresponds to A ~ (100 — 400) MeV, depending on the exact 
definition. This sets the scale at which the strong interactions become strong and determines 
the approximate scale of such strong interaction quantities as the nucleon and p masses. In 
fact, the physical hadrons not involving the c or b quark receive relatively little contribution 
from the bare quark masses (this is especially true of the non-strange ones), and have masses 
dominated by the dynamical or constitutent quark mass Mayn ~ A, which (along with the 
pion decay constant f,(A)) is associated with the spontaneous breaking of the SU(2) x SU(2) 
or SU(3) x SU(3) chiral symmetry. The one exception is the pseudoscalar octet, which are 
pseudo-Goldstone bosons with masses generated by the explicit chiral breaking from the 
bare quark masses (see Sections 3.3.3 and 5.8). 
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If one ignores the bare masses, there are no dimensionful parameters in the QCD La- 
grangian. However, A emerges by dimensional transmutation, i.e., it is the scale at which 
the dimensionless coupling becomes large. This implies that only dimensionless ratios like 
m,/A or m,/A are meaningful, at least in the m, = 0 limit. One could choose to simply 
define units so that A = 1, analogous to c = h = 1. However, it is more convenient to 
keep the “historical units” for masses, or to scale everything in terms of easily measured 
quantities such as mp. Of course, when one considers other interactions, new scales, such 
as the Z mass of the Planck scale, also become relevant. 


5.5 DEEP INELASTIC SCATTERING 


Let us consider deep inelastic lepton nucleon scattering (DIS) in more detail. In this chap- 
ter we focus on e-p — e` X, as shown in Figure 5.11, where the momentum transfer 
Q? = —q? > 1 GeV’. This limit is in the short distance regime, so to first approximation 
one can describe the underlying process as the scattering of the virtual photon with a free 
quark, as in Figure 5.4 (the simple parton model or SPM). Only the final electron is ob- 
served in the scattering, and the unobserved hadrons X are summed over. These typically 
involve complicated multihadron states. Other deep inelastic reactions substitute et, p*, 


and Vp for the e~ , with neutrino scattering proceeding by either charged or neutral current 
processes through the exchange of W or Z. [The HERA e*p scatterings involved all of 
these processes (Diaconu et al., 2010; Abramowicz et al., 2015).] The initial proton may be 
replaced by a nuclear target, which either gives a weighted sum of p and n cross sections, or, 
for bubble chamber experiments, sometimes allows one to identify whether the scattering 
was from a p or n. We will concentrate on unpolarized scattering, but some experiments 
have involved polarized ¢* and/or hadrons. 

It is difficult to overestimate the importance of deep inelastic scattering. DIS via y or 
W= exchange was especially important for establishing the existence of point-like quarks, 


while the test of the higher-order corrections to the SPM helped establish QCD. Similarly, 
( 


the neutral current processes probed in DN > D, X were extremely important early tests 
of the standard electroweak theory. More detailed descriptions may be found in (Renton, 
1990; Barger and Phillips, 1997; Ellis et al., 2003; De Roeck and Thorne, 2011; Perez and 
Rizvi, 2013; Blumlein, 2013; Patrignani, 2016). 


5.5.1 Deep Inelastic Kinematics 


The kinematics of DIS are indicated in Figure 5.11. Since the final hadrons are not directly 
observed, the independent variables for the unpolarized case are the four-momenta p, k, 
and k’ of the proton and of the initial and final electron. Except for the HERA e®p collider, 
all of the experiments have been performed in the proton or nuclear rest frame (the lab 
frame), where the observables are the energies Ep x’ of the initial and final electrons, and 
the electron scattering angle 0. 

There are a number of useful related kinematic variables, which we express both in 
Lorentz invariant form and in terms of the lab frame observables. The total CM energy 
squared is 


s= (k +p}? = m2 + M? + 2EM ~ M? +2kM, (5.40) 


where M is the nucleon mass, and k ~ Ex, is the magnitude of the e~ three-momentum. 
In the remainder, we neglect the electron mass me. The momentum transfer-square and 
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Figure 5.11 Left: deep inelastic scattering e~p — e` X, where X represents unob- 
served hadrons. q = k — k’ is the four-momentum of the virtual photon. Right: 
kinematics in the proton rest frame. Ep and Ex are the energies of the initial and 
final electrons, and 0 is the laboratory scattering angle. 


energy transfer to the hadrons are 


Q@a=-¢ = -k-k z 2kk'(1 — cos 0) 


; 5.41) 
pd , ( 
= Eg — Ep wk-k. 
7 M ab k k 
The invariant mass-square of the unobserved final hadrons is 
W? = P$ = (p +q}? = M? + 2Mv - Q? 
(5.42) 


FH M? +2M(k — k') — 2kk' (1 — cos 0). 


Elastic scattering X = p is a special case with W? = M? and Q? = 2Mv. The next 
hadronic threshold is for a nucleon plus one pion, i.e., X = p+7° or n+r*, corresponding to 
W? > (M+m,)?. Still larger W? can involve more complicated many-particle states, which 
are summed over. The three independent kinematic variables are (s, Q?, v), or equivalently 
(k, k’,@) (only two are independent in the special case of elastic scattering). However, the 
hadronic part of the process can only depend on the two Lorentz invariants Q? and v. 
For a fixed initial energy k, the variables Q? and v (and therefore W?) can be varied and 
determined from k’ and 8. 
It is convenient to define dimensionless variables 
Q? V Ex = Ey k— k' 


r= y= oan (5.43) 


x is defined kinematically. However, in the SPM z will be interpreted as the fraction of the 
proton momentum carried by the scattered parton.!? y is the fraction of the e~ energy in 
the lab frame that is transferred to the hadrons. Their ranges are 


1 


te k>M 


(5.44) 


The relation of x, W?, and @ to v and Q? is shown in Figure 5.12. 


10Rarly papers often used the variable w = 1/c. 
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y= 1 
Q? 

2 = 0:5 
x = 0.25 


Figure 5.12 Kinematic variables for deep inelastic scattering for fixed initial energy 
Ex ~ k > me. The horizontal and vertical axes are, respectively, 2Mv = 2My/k 
and Q? = 2Mxy/k, each running from 0 to 2Mk. The sloping solid lines are for 
fixed 0 < x = Q? /2Mv < 1. The dashed lines are for fixed hadronic invariant mass- 
square W? = M? +2Mv — Q?, where W? = M? along the x = 1 line, increasing as 
one moves down and to the right. The dotted lines are for fixed laboratory scattering 
angle 0, with 03 > 02 > 6. 


5.5.2 The Cross Section and Structure Functions 


First consider elastic scattering e~ p > e` p for a hypothetical point-like proton. The spin- 
averaged cross section from the first diagram in Figure 5.13 (with a point vertex) is 


(Qr)454(k! +p! — k- p) dk’ dy! et eT 6 45) 
4Mk (27)32Ep (2m)32Ey gt © PHY? 


do = 
where q = k — k' = p' — p, and the traces are collected in the leptonic and hadronic tensors 


1 2 
L = 3 CKE me CK me] = 2 (ee eRe + gee 


2 
j 9 (5.46) 
q 
Lowy = z [u ( Ø + My (Ø' + M)| =2 [ovvi + DyPv F owg] . 
The cross section can be rewritten 
d 2k! 
7 eE LY’ Wav, (5.47) 


where dQ = d cos édy, and 


dp / (27 )46t( (p' —p— 
mesz roe J= on D (psi, (OP's 0's!Jqu(O)Ips)- (5-48) 


82E py 
The current in (5.48) is the proton part of the electromagnetic current 


JE = Ppr Vp = JÈ, (5.49) 
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Figure 5.13 Left: elastic scattering e7 p — e`p. The shaded circle represents the 
effects of the strong interactions. Right: deep inelastic scattering e7 p > e7 X. 


with 
(p's'|JG(0)|ps) = a(p', s')y”u(p, 5). (5.50) 
Of course, 
(ps| JE lp's’) = (p's' | Jb |ps)*. (5.51) 


JE is actually Hermitian, but it is useful to write (5.48) and (5.50) in a more general form 
for a later extension to the weak interactions. The tensor W,,, contains all of the information 
about the hadrons. 

The cross section expression in (5.47) can be immediately extended to elastic scattering 
from a physical (strongly-interacting) proton, provided one replaces (5.50) by 


(p's'|J6(0)|ps) > u(p’, s‘)PGu(p, s), (5.52) 


where Tó (p',p) is the vertex function that includes strong corrections. As discussed in 
Section 2.12.4, the combination of Lorentz covariance, electromagnetic current conservation, 
and the observed reflection invariance of the strong interactions restricts T6 (p', p) to the 
form 

io™’qp 

2M 

where F? (q°) are form factors that can depend on q? with F?(0) = 1 and F} (0) = kp, 
where Kp ~ 1.79 is the anomalous magnetic moment of the proton. For elastic scattering k’ 
and cos @ are not independent, but are related by (2.392) on page 77, i.e., 


TQ = 7 FRG) + F3(q"), (5.53) 


i 1 
k 1+ Ẹ(1-—cos0)’ 


(5.54) 


(enforced by an energy-conserving delta function in Wy), with the correspondence of no- 
tation (k1, k2) > (k, k’), 0r > 0, and mp > M. From (5.47) and (5.53) one obtains the 
Rosenbluth cross section formula for elastic scattering given in (2.400). 

Expression (5.47) continues to hold for the inelastic case provided the hadronic tensor 
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is redefined as 
AT DESI I a Nemts(+a- Doh) 
pu TH 2r)?2Ep, 4 tn 
x venient (5.55) 


where Xy is an N particle state that may contain both fermions and bosons. In the inelastic 
case k’ and cos @ are independent variables related to the invariant mass W? by (5.42). For 
unpolarized protons the tensor W,,, can only depend on the four-vectors p and q (it can 
also depend on the spin vector s in the polarized case). The only tensors one can construct 
are 


symmetric: PuPv; QWs Pul + quPv, pw 


5.56 
antisymmetric: Pudv — 4uPv, Euvpo P’, ( ) 


each of which can be multiplied by a function of the Lorentz invariants Q? and v. One can 
use the reflection invariance of the strong interactions to show that the €,,,,. term is absent. 
In any case, the leptonic tensor L#” is symmetric,'! so one can keep just the symmetric part 
of Wu. Furthermore, the electromagnetic current is conserved, 0“Jg,, = 0, which implies 


q Wu = 0, q” py = 0. (5.57) 


Thus, only two linear combinations survive, and the most general form is 
WW 1 pq pq 
Wuv = [one + S| Wi (Q80) + gpa [ou Paan] [oe — Patan] m, 658) 


with Q? = —q? > 0. The real Lorentz invariant functions W1,2(Q?, v) are known as the 
proton structure functions. They generalize the form factors F} 2(q°) of elastic scattering, 
and contain all of the information about the strong interactions effects. One can combine 
(5.47) and (5.58) to obtain 


da _ a? 
dk'dQ 4k? sin4 


0 0 
- [w (Q?, v) cos? z 2M (Q?, v) sin? | (5.59) 
2 
for the deep inelastic cross section in the proton rest frame. The solid angle element is 
usually integrated over the azimuthal angle, dQ = dpd cos @ —> 27d cos 6, and the kinematic 
variables can be rewritten 


a 
dk'dcos 0 = Ad ——dx : 
cos ow dQ*d dy (5.60) 
to yield 
da do nMa?y 0 0 
— = 2Mk? = ? 2-42 ? v)sin? =|. (5.61 
dedy YaQ?dv ~ Wek’ sint? |"? (cor eWay je een) 


11 Both the leptonic and hadronic tensors have €pvpo terms if there is polarization, or for parity-violating 
weak processes such as vN + vX and vN —> uX due to the interference between the vector and axial 
currents. 
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W, and W, can be determined separately from the data by varying 6,k, and k’ for fixed 
Q? and v. 
An alternative notation is to use the variables x and Q? and to define 


F, (x, Q*) = MW, (Q?,v), Fp (2, Q?) = vW (Q?,v). (5.62) 


This is useful because the simple parton model (QCD) predicts that F; is independent of 
(slowly varying with) Q? for Q? large. Then, 


dö 8ra? Mk 5 Mery x 
inj = [r (z,Q?) (1 ssp 2) + 20 Fy («,Q?) | (5.63) 
8ra? Mk 1+(1-y)? MayF» 
ae a 2er; (5 + (Fy — 20F\)(1-y) - I. 


In the second form, the last term vanishes for k >> M, while the middle term vanishes in 
the simple quark parton model. 

The deep inelastic limit is defined as Q?, > œœ with x = Q?/2Mv fixed. If the proton 
were an extended fuzzy object, one would expect F;(x,Q?) — 0 in this limit. However, the 
MIT-SLAC experiments (Friedman and Kendall, 1972; Mishra and Sciulli, 1989) circa 1970 
showed instead that 

F;(z, Q?) Ee, F;(x) £ 0, (5.64) 
=> o0 


a property known as Bjorken scaling (Bjorken, 1969). This scaling can be understood in the 
Feynman parton model (e.g., Field and Feynman, 1977; Drell et al., 1969), in which the pro- 
ton is made up of hard point-like parton constituents. The observed y distribution showed 
that the partons have spin-4, consistent with QCD and asymptotic freedom. Subsequent 
experiments established that the scaling is only approximate, and in fact the slow (loga- 
rithmic) variation of the F; with Q? is what one expects from the higher-order corrections 


in QCD.” 


5.5.3. The Simple Quark Parton Model (SPM) 


The cross section for elastic scattering e~ p — e~ p from a point proton is given in (2.391) on 
page 77, with the kinematic constraint for k’ given in (5.54) (see the subsequent comment 
on notation). It is convenient to rewrite (2.391) as 


do a? T 20 Q? 8 
dQ? 4k? sin? $ kk! feos 2° 2M2” 3 i (5.65) 


where we have used dQ? = 2k’?dcos@, which follows from (5.41) and (5.54). For elastic 
scattering, Q? = 2Mv from (5.42), so (5.65) can also be written as 


da a? T 2 (4) Q? = 0 Q? 
dQ? dv 4k? sinf { kk’ feos 2 2M2 es 2 oye 2M )° (5.66) 


12Scaling also breaks down at low Q?, where strong coupling effects are important. In particular, hadronic 
resonances in the yp channel for fixed W? are important. The scaling behavior of the structure functions 
smoothly interpolates these resonances (Bloom and Gilman, 1971; Melnitchouk et al., 2005), as can be 
understood from finite energy sum rules (FESR), derivable from analyticity. 
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This is a special case of the general formula (5.61) provided we identify the structure 


functions as 
2 2 2 s 
ex O @\_ Q Q 
Wi(Q",v) = Imz? ( S7) ~ 4M2p i (1 iv) 


Q? 1 Q? 
W2(Q?,v) =6 =— (1 . 
2(Q",¥) ( 2M) v 2Mv 
Now assume that the proton is a bound state of point-like quarks. Consider the contribution 
to the structure functions from an individual quark q; with electric charge e;, which carries 


four-momentum 2;p, where p is the proton momentum. It is plausible that (5.67) applies 
for that quark, provided one replaces M —> x;M and multiplies by eĉ, i.e., 


e7Q? Q? eti Q? 
mEn aila) ME a (2-27). 6.09) 


(5.67) 


(This result can be better derived in the infinite momentum frame, where the proton and 
quark masses and the transverse momentum of the proton relative to the electron direction 
are negligible.) The contribution of that quark to the structure functions is 


Fi (x, Q’) = MWj(Q?,v) = 56:8 (e: = an) 
(5.69) 


Í 7 Q? 
Fi(x, Q?) = vWi(Q?, v) = aid (a - ) l 


To find F; a(x, Q?) one must sum over the quark types and integrate over their possible 
momenta. This is done in the cross section (i.e., in F1,2) because the different i and z; lead to 
different (incoherent) final states. Introduce the parton distribution function (PDF) qi(xi) 
as the probability density (the absolute square of the momentum space wave function) for 
finding quark q; in the proton with momentum fraction xi. Then from (5.69) the predicted 
structure functions are 


Fy (£,Q?) = 2a (x, Q?) 


1 
P (0) 


The simple parton model therefore predicts that the F; are independent of Q? for large 
Q? (Bjorken scaling). The quantity xq;(x) is interpreted as the momentum distribution for 
parton qi. 

The predicted relation F> = 2xF;, the Callan-Gross relation (Callan and Gross, 1969), 
is a signature of spin-4 constituents. Scattering from spin-0 constituents would lead to 
Fı = 0, F # 0 and therefore a different angular distribution. An interpretation of this 
result is that one can show (see, e.g., Renton, 1990) that 


Q p 5.70 
saa) 7 de ae om 


OL Fy — 2x Fi 
OT 22 Fy , 


R(z, Q?) = (5.71) 
where oy and or are, respectively, the total cross sections for y pp, where yj 7 is a virtual 
photon with momentum q, and L and T refer to longitudinal (photon helicity 0) and trans- 
verse (photon helicities +1) polarizations. In the Breit frame (discussed in Section 2.3.4), 
the virtual photon has four-momentum q = (0,0,0,—@Q), while the incident [final] parton 
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has momentum $(Q,0,0,Q) [$(Q, 0,0, —Q)], as in Figure 2.4. For spin-0 partons one would 
have ør = 0 (R = œœ) by angular momentum conservation, since there is no orbital angular 
momentum along the direction of the photon and parton momenta. For spin-} partons, on 
the other hand, oc, = R = 0 using the fact that helicity is conserved for vector transitions 
of massless spin-5 particles, e.g., (2.214) on page 42. 

Even to the extent that the SPM is valid, a real proton is expected to consist of not 
only the three valence quarks uud of the quark model, but also a sea of qq pairs and of 
gluons produced by soft (low momentum) processes such as illustrated in Figure 5.14. The 


q q 

mG én} 

p= 89999 P E 4 
ee — q A 
q q 


Figure 5.14 Higher-order soft processes that produce a sea of gluons and qq pairs. 


anitiquarks contribute to the F; with the same formula, except qi(x) > qi(x), 
F, (a, Q") = 22F; (x,Q°) = y e} x(qi(x) + Gi(z)). (5.72) 


The gluons do not contribute directly to electromagnetic processes (they do contribute to 
purely hadronic short distance processes). It is convenient to write 


qi(x) = qvi(x) + asi(z), G(x) = Gs: (x), (5.73) 


where qy; and qs; represent the valence and sea quarks, respectively. The quark distribution 
functions are determined by long distance effects, and at present there is no way to reli- 
ably calculate them. However, there are a number of plausible constraints and consistency 
conditions. One expects 

qsi(x) ~ Gsi(z), (5.74) 
as is suggested by the diagrams in Figure 5.14. However, this is not a rigorous result and 
there could be small deviations. A more precise result is 


| dz (qsi(x) — Ysi(x)] = 0. (5.75) 


This follows from the meaning of valence and sea quarks, and should hold to the extent that 
the SPM is valid. Similarly, the proton should have two valence u quarks and one valence 
d 


f ae =o. f besi (5.76) 


which implies 


(5.77) 
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These predictions are difficult to measure, but existing data is consistent. Because of quark 
mass effects one expects b < Z < 5 < ŭ, d, and for the kinematic region relevant to the MIT- 
SLAC experiments it is reasonable to neglect the b, b, c, and Z. Since the u and d quark masses 
are so small compared to A it is a reasonable first approximation to expect u(x) ~ d(x). This 
is not a rigorous consequence of isospin since the proton is not an isosinglet, but should hold 
approximately because of the isospin invariance of the gluon coupling. Another constraint 
is that 


ye f satanda f pade = 1, (5.78) 


where G(x) is the gluon probability distribution, which can be probed in hadronic processes. 
Equation (5.78) states that the total momenta of all of the constituents must add up to the 
proton momentum. 

From (5.72), one predicts 


z (u(x) + u(ax)] + =x [d(x) + d(x)| + —2 [s(x) + 3(£)]. (5.79) 


One cannot distinguish the various distribution functions using e~ p + e~ X data alone, but 
separation between the quarks and antiquarks and between flavors can be accomplished by 
considering scattering from neutrons and other deep inelastic processes, such as 


Typ —+ wtX and etp > BX, (5.80) 


as will be described in Chapter 8. One finds that the momentum fractions of the proton 
constituents are of order 


J xu(x)dz ~ 0.30, I ad(x)dx ~ 0.15, / zG(x)dz ~ 0.50. (5.81) 


The momenta carried by the u + d is about 10% that of the u and d, while the s + 5 
momentum fraction is about half of that. These rough estimates apply either to SPM 
analyses of relatively low Q? data (e.g., Field and Feynman, 1977), or as the values at a low 
initial Q? (e.g., 5 GeV?) in the more sophisticated QCD improved model described below. 
The actual momentum distributions obtained in a recent analysis are shown in Figure 5.15. 

The quark distributions and structure functions defined above referred to the proton, 
and it is sometimes useful to denote them as W?, FP, q? to emphasize that fact. One can 
define analogous quantities for a neutron target, such as q7}. The SPM prediction is 


F} (x,Q?°) = 22F? (x,Q?°) = 5, e? x (q? (x) + F(a). (5.82) 


The neutron distribution functions are not independent, but are related to those of the 
proton by isospin, i.e., 

u” (x) = d(x), d(x) = uP(x), s” (a) = s” (x) (5.83) 
up to possible small isospin-breaking corrections. Similar results hold for the antiquarks. It 
is also useful to define isospin-averaged distribution functions 
S 1 n 1 

(x) = 5 W” (2) + u” (x)] = 5 lu” (2) + (2) 
1 1 
d? (x) = z P(e) + d” (2)] = 5 [d (2) + u” (2)] = u(x), 


u 


(5.84) 
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analogous to (2.402). These are the distribution functions per nucleon for nuclei with equal 
numbers of protons and neutrons such as 1°C or (up to a correction) °° Fe. For such targets, 


FS (x, Q?) = 2aF 3 («, Q) = 2 e? x [gf (x) +a (2)]. (5.85) 


Most experiments are done on heavier nuclei because higher statistics can be achieved, even 
though information on the isospin structure is lost. 

The simple quark parton model was important because it approximately described the 
observed properties of deep inelastic scattering at moderate Q?. The approximate scaling 
established the existence of point-like (i.e., asymptotically free) constituents of the nucleon, 
and the relation F(x) ~ 2xF\(«) indicated that the constituents have spin-}. The mea- 
sured distribution functions q(x) give more detailed information about the distribution of 
the quarks within the nucleon. Further implications of deep inelastic scattering processes 
involving W= and Z exchange will be described in Chapter 8. We now turn, however, to 
the corrections to the SPM as predicted by QCD. 
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Figure 5.15 Momentum distributions æ f(x) of the gluon (g), valence quarks (uy, dy), 
and sea quarks (u,d,s,c,b) from a recent global analysis (Ball et al., 2015), at u? 
(i.e., Q?) = 10 GeV? (left), and u? = 104 GeV? (right). The heavy quarks become 
more important for large Q? and small z, where mass effects are less important. 
Plot courtesy of the Particle Data Group (Patrignani, 2016). 


5.5.4 Corrections to the Simple Parton Model 


The scattering from non-interacting point-like quarks in the SPM leads to Q?-independent 
structure functions F} 2(x, Q?) > F,2(x). However, the asymptotic freedom of QCD pre- 
dicts that a;(Q?) is small but nonzero at the observed scales, so one expects higher-order 
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Figure 5.16 Lowest order QCD corrections to the simple parton model. 


corrections to scaling and the SPM from diagrams such as those in Figure 5.16. These lead 
to logarithmic Q? effects that can be interpreted as an effective Q? dependence of the quark 
and gluon distribution functions, in what is known as the QCD-improved parton model. 

A heuristic picture is that at moderate Q? (e.g., 10 GeV?) there are relatively few gluons 
or sea quarks, at least in the region x > 0.2. At higher Q?, on the other hand, the virtual 
photon or other probe can resolve more qq pairs and gluons associated with the splittings in 
Figures 5.14 and 5.16. One therefore expects a somewhat reduced momentum distribution 
for the valence quarks, and enhanced sea quark and gluon distributions at low x. This indeed 
is the case, as can be seen in Figure 5.15. The most dramatic effect is for x < 0.05, which 
has especially been studied at the HERA ep collider at DESY. The observed dependence of 
F(x, Q?) for fixed x and varying Q? is shown in Figure 5.17. 

More precisely, the QCD corrections are dominated by gluon emissions and other cor- 
rections involving small transverse momenta kr. These cannot be calculated completely in 
perturbation theory because of collinear singularities (e.g., Ellis et al., 2003; Salam, 2010a). 
Rather, those with kr smaller than some factorization scale up are absorbed into the dis- 
tribution functions,!? which therefore depend on pp in a calculable way, while those with 
larger kr are treated explicitly. wr is usually taken to be Q for deep inelastic scattering. 

The distribution functions f;(x, Q?) where fi = qi,qi,G therefore depend both on x 
and Q? in a complicated way. Fortunately, one can parametrize them and test QCD by 
a two-step process. First, f;(a,Q?) can be measured at some convenient reference scale 
Qg, thus determining the long distance effects that cannot be calculated perturbatively 
or by other presently available techniques.!4 Then, the In Q? evolution of the f;(x, Q?) to 
larger Q? values can be predicted from QCD and compared with the experimental data. In 
practice, one generally assumes (physically motivated) analytic expressions for the f;(x, Q3) 
in terms of several unknown parameters, and then determines those parameters as well as 
as(Q?) (i.e., A) by a fit to the data at all Q?. The evolution actually depends on all of the 
distribution functions, including the gluon’s. Therefore, modern fits often make use of all 
relevant data from the various deep inelastic and other short distance processes. 

It is not sufficient to consider only the lowest order diagrams in Figure 5.16. These scale 
as as(Q?) In Q?, which is not small. It is possible to sum the leading order (LO) contri- 
butions of O[(as ln Q?)”] for all n by integrating the Dokshitzer-Gribov-Lipatov-Altarelli- 


13This is an infrared analog of renormalization theory for ultraviolet divergences. 
14Jn principle, the f; (x, Q3) could be calculated by lattice techniques, but this would be difficult. 
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Figure 5.17 Reduced cross sections for e*p + e*X as a function of Q? for various 


fixed x as measured at HERA. These are essentially Fo (x, Q?) except at the high- 
est Q?, where the effects of Z exchange (which distinguish e~ [upper curve] from 
e*t [lower]) become significant. The lines are the result of a QCD fit. Reprinted 
from (Abramowicz et al., 2015), with kind permission of The European Physical 


Journal (EPJ). 
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Parisi (DGLAP), or just Altarelli-Parisi, equations (see, e.g., Altarelli and Parisi, 1977) 


dG(x,Q?) z as (Q?) H dw Da Ja ( (w Q *)+Pae (<) G (w,Q") 


ae ae (5.86) 
se [ [Poa (=) a: (wQ?) + Pra (=) G (w,@)] 


where q; includes both q; and q;. In (5.86), the Pr, ¢,(z) are the splitting functions that 
describe the probability for parton f; to emit parton fj carrying a fraction z < 1 of the fi 
momentum. Their explicit forms are given in, e.g., (Ellis et al., 2003). The splitting functions 
are perturbative and can be expanded in a,. The zeroth order term (one factor a, is already 
extracted in (5.86)) yields the LO approximation. The next to leading order (NLO) terms, of 
order Ola” (In Q?)"~+], involve the O(a) corrections to the splitting functions. The splitting 
functions have been calculated to NNLO (for the original references, see Patrignani, 2016), 
which are used in modern analyses.!° 

It is possible to integrate the DGLAP equations numerically. One can also consider the 
moments (Mellin transformations) of the distribution functions, 


1 
fi(N, Q?) = f rN! f(x, Q?)dx. (5.87) 


The moments of the flavor non-singlet (ns) distribution functions (such as qi — G or qi — 
qj) satisfy simple first order equations, which are easily integrated. In LO the non-singlet 
moments are predicted to scale as 


fns( N,Q?) = fna(N, Q2) (23) | (5.88) 


where the anomalous dimension yn is a known function of the number of light flavors, 
calculable from P}q and as. Similarly, the moments of the G and singlet (i.e., X; (qi + &)) 
distribution functions satisfy coupled differential equations. After solving, the distribution 
functions can be recovered from the inverse Mellin transformation. The non-singlet moment 
evolutions in principle provide very clean tests of QCD and determinations of a,, since 
they are independent of the gluon distribution, but in practice the tests are limited by 
uncertainties in the distribution functions. 

There are many complications to the description of deep inelastic scattering. For very 
small x, as studied at HERA, there are important In(1/2) contributions to the splitting 
functions, which must be summed. Also, at moderate Q? values there are mass and higher 
twist effects, due to diagrams such as gluon exchange between the scattered and unscattered 
quarks, which are of order (1/Q?)",n > 1. These are difficult to calculate but can be 
parametrized. Another complication is that R(x, Q?), defined in (5.71), does not really 
vanish, due to QCD corrections and effects neglected in the SPM, such as quark mass and 
transverse momentum within the proton. R is difficult to measure precisely, but is consistent 
with the expected values < 0.1. Nuclear effects for heavy targets can also be important (e.g., 
Armesto, 2006). 

Detailed global analyses of the parton distribution functions (PDFs) at LO, NLO, 
and NNLO have been carried out by a number of groups, including ABM (Alekhin 


15Other quantities, such as as (pl?) and the parton-parton cross sections for short-distance hadronic pro- 
cesses must be calculated to the same order. 
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et al., 2014), JR (Jimenez-Delgado and Reya, 2014), NNPDF (Ball et al., 2015), 
MSTW/MMHT (Harland-Lang et al., 2015), HERAPDF (Abramowicz et al., 2015), and 
CTEQ/CT (Dulat et al., 2016), with the PDFs available from lhapdf .hepforge.org. Im- 
pressive agreement between the data and the QCD predictions was obtained. 


5.6 OTHER SHORT DISTANCE PROCESSES 


The formalism for deep inelastic scattering is easily extended to allow for polarized beams 
and/or targets (e.g., Patrignani, 2016). Experiments measuring elastic and inelastic electron 
or muon asymmetries with polarized beams have been carried out at SLAC, Jefferson Lab, 
Bates, and Mainz; experiments involving both polarized leptons and polarized nucleons have 
been done at DESY, CERN, SLAC and Jefferson Lab; and those with polarized pp scattering 
at RHIC. These involve not only photon but also Z exchange, and allow tests of the elec- 
troweak theory and detailed studies of the nucleon, such as its spin distribution (Burkardt 
et al., 2010; Aidala et al., 2013; Leader and Lorcé, 2014). Surprisingly, only about 1/3 of 
the nucleon spin appears to be due to the quark spins, with the remainder presumably from 
gluons and orbital angular momenta. They also yield information on the strange quark 
content and matrix elements of the proton (Kaplan and Manohar, 1988; Armstrong and 
McKeown, 2012), which are relevant to the spin question, the interpretation of dark matter 
experiments (Section 10.1.2), precision experiments, and the o term (Problem 5.13). 

One can also introduce structure functions for the photon. These are useful when a pho- 
ton turns into a virtual gg pair, and a subsequent interaction is more associated with these 
hadronic constitutents than with the direct photon couplings.'® (One says that the parton 
content of the photon has been resolved, and the structure function describes the parton 
distributions.) The photon structure functions can be measured in the deep-inelastic scat- 
tering of, e.g., an electron on a quasi-real (i.e., nearly on shell) photon emitted by a positron 
in e~e* scattering. This can occur by the exchange of a second (highly virtual) photon, 
similar to the right-hand diagram in Figure 5.16 with the gluon replaced by the quasi-real 
photon and the proton (and unscattered quarks) replaced by a positron. The overall cross 
section can be given in terms of the photon structure functions and the effective flux of these 
quasi-real photons (in the equivalent photon approximation). The photon structure has also 
been studied in e~ p scattering at HERA, where in appropropriate kinematic regimes the 
constitutents of a quasi-real photon emitted by the e~ scatter from the proton constituents, 
similar to the short distance hadron-hadron interactions to be considered in Chapter 6. The 
formalism, measurements, and QCD evolution is reviewed in Nisius (2000). 

PDFs can be thought of as probability distributions in momentum space and expressed 
as diagonal matrix elements of quark or gluon bilinear operators. They can be extended to 
generalized parton distributions. These are off-diagonal matrix elements!’ that carry more 
information than the PDFs, i.e., the full three-dimensional momentum and spin distribu- 
tions of the partons within the nucleon. They can be probed experimentally in a variety of 
exclusive processes (e.g., Diehl, 2003; Belitsky and Radyushkin, 2005). 

Many processes involving short-distance scatterings of the constituent quarks and gluons 
are observed at hadron colliders. These are essential for testing QCD and are both the 


16Similar considerations apply in other regimes. For example, the electromagnetic form factors of hadrons 
are dominated in the timelike region by vector mesons with the quantum numbers of the photon, i.e., 
the p, w, and ¢, as in Equation (2.387) on page 76. The vector meson dominance (VMD) model (e.g., 
Schildknecht, 2006) further assumed that photoproduction (yN scattering) and other interactions of the 
photon with hadrons are dominated by their vector meson components. 

17They are analogous to the density matrix in quantum mechanics, which carries more information than 
the modulus of the wave function. 
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backgrounds and possible signals in searches for new physics. These will be discussed in 
some detail in Chapter 6. 

Other powerful techniques allow separation of perturbative and non-perturbative effects, 
e.g., in the decays of mesons involving a single heavy quark, such as the B+ = ub. In 
particular, heavy quark effective theory (HQET) (Isgur and Wise, 1989; Bigi et al., 1997; 
Neubert, 1994) and soft collinear effective theory (SCET) (Neubert, 2005; Becher et al., 
2015) are, respectively, based on systematic expansions in Agcp/mg and Agcp/E, where 
mg and E are large quark masses and large energies of decay products. Non-relativistic 
QCD (NRQCD) (Bodwin et al., 1995) is useful for describing states with two heavy quarks, 
such as upsilonium (bb). 


5.7__ THE STRONG INTERACTIONS AT LONG DISTANCES 


At low energies or long distances a, — O(1) so one cannot use perturbative techniques. 
Quarks and gluons are confined, and hadrons become the basic degrees of freedom. Never- 
theless, there are a number of tools available for understanding the strong interactions. One 
class involves phenomenological models, which typically are useful in limited domains of 
kinematics and parameters. These include S-matrix theory and models, such as dispersion 
relations, the Veneziano (dual resonance) model, Regge theory and the pomeron!® (Collins, 
1977; Donnachie and Landshoff, 2013); the MIT bag model (Chodos et al., 1974); and one 
boson exchange potentials in nuclear physics (Epelbaum et al., 2009). Another approach, 
which we touched on in Chapter 3 and will elaborate on in the next section, involves the 
chiral flavor symmetries of the strong interactions, including related techniques such as 
current algebra (Adler and Dashen, 1968; D’Alfaro et al., 1973; Coleman, 1985; Weinberg, 
2009); chiral perturbation theory (e.g., Pich, 1995; Bernard, 2008; Cirigliano et al., 2012; 
Bijnens and Ecker, 2014); QCD sum rules (Colangelo and Khodjamirian, 2000); the 1/N. 
expansion (’t Hooft, 1974); the Skyrme model (Skyrme, 1962; Adkins et al., 1983; Zahed 
and Brown, 1986); and the OZI (Zweig) rule (Lipkin, 1984). 

Lattice QCD (e.g., Gupta, 1997; DeGrand and DeTar, 2006; Kronfeld, 2012; Patrignani, 
2016) is a non-perturbative definition of QCD. It is based on approximating space and 
time by a discrete lattice of points, allowing the QCD equations to be solved numerically, 
typically on a supercomputer. Most recent calculations have been able to include dynamical 
fermions (i.e., in loops), a significant improvement over earlier (quenched approximation) 
calculations. Lattice QCD has had considerable success in computing the hadron (Davies 
et al., 2004; Dürr et al., 2008) and gluonium spectra, demonstrating quark confinement, 
calculating form factors necessary for the weak interactions (Aoki et al., 2017), and in 
computing finite temperature effects (e.g., DeTar and Heller, 2009). 


Confinement 


It is believed that under ordinary conditions color is confined, preventing the production of 
isolated quarks and gluons. First consider QED. A well-separated e~ and e* act as sources of 
a classical electromagnetic field. The photons have no self-interactions, and the non-linear 
effects due to QED are small. The electromagnetic field therefore spreads out in space, 
leading to the familiar V(r) ~ a/r potential between the charged particles. Given sufficient 
energy they can get arbitrarily far apart, so there is no confinement. The qualitative picture 


18Pomeron exchange can be roughly thought of as the exchange of two gluons in a color-singlet state. 
See, e.g., (Donnachie et al., 2002; Domokos et al., 2009) for a more detailed discussion of the connection to 
QCD and string theory. 
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for QCD, which is supported by lattice calculations, is very different. When a quark and 
antiquark get far enough apart the classical gluon field they produce forms into a long 
narrow flux tube due to the strong coupling and gluon self-interactions. The energy stored 
in such a flux tube is proportional to its length, so that there is an effective potential between 
the q and g which grows linearly with their separation, V(r) ~ «r. It would take infinite 
energy to separate them, so they are confined. Actually, once the q and g are far enough 
apart so that kr = 2m, the tube can create a gq pair and break, leading to two mesons 
rather than isolated quarks. This is analogous to cutting a bar magnet, which creates two 
smaller bar magnets rather than a monopole-antimonopole pair. 

The linear gg potential can be combined with a one-gluon exchange short-distance term, 


4 as 
V(r) = = + Kr, (5.89) 
in a non-relativistic model!® for heavy meson bound states, such as charmonium (cĉ) and 
bottomonium (bb) (e.g., Eichten et al., 2008). The 4/3 is appropriate for color-singlets, and 
a is evaluated at the gq mass scale. Fits to the spectrum and decays such as J/q > ete7 
are reasonably successful and obtain « ~ 1 GeV/fm ~ 0.2 GeV?. See Problems 5.9-5.11. 


5.8 THE SYMMETRIES OF QCD 


QCD contains a number of accidental global symmetries in addition to the gauged color 
SU(3)-. (Weinberg, 1973a,b), where the subscript c identifies SU(3). as the group associ- 
ated with color. They are referred to as accidental because no terms consistent with gauge 
invariance and renormalizability can be written to violate them without introducing extra 
fields. For example, for 6gcp = 0 the Lagrangian density in (5.2) is automatically invari- 
ant under the discrete space reflection (P), charge conjugation (C), and time reversal (T) 
transformations.7° 


5.8.1 Continuous Flavor Symmetries 


The gauge interaction terms in £gcp for ng quark flavors are invariant under an SU(nq)1 X 
SU(nq)r x U(1)g x U(1)4 global chiral flavor symmetry that commutes with SU (3)e. The 
quark mass terms break the chiral symmetries, and the non-chiral subgroup if they are 
not degenerate. Meb are all large compared to the QCD scale A so the flavor symmetry 
associated with them is too badly broken to be relevant. Therefore, we will focus on the 
approximate SU (3)z x SU(3)r x U(1)g x U(1)4 symmetry of the u,d, and s quarks, and, 
since ms S A, on its SU (2)z x SU(2)r x U(1)g x U(1)4 subgroup, extending the discussion 
in Chapter 3. The relevant quark part of the Lagrangian density is 

La = GP —m)q= qr iPa + GriPar — (emgr + Grmqz) , (5.90) 
where q = (u ds)’ represent the quark fields, m = diag(m, ma ms) is the diagonal quark 
mass matrix, and the color indices are suppressed. m can be written as 


m = mol + mgr? + mgA’, (5.91) 


19Other potentials, such as V(r) = Alnr, which roughly interpolates between the terms in (5.89), can 
also be used. 

20Parity violation in strong processes can be induced by weak interaction perturbations (W+ exchange). 
In a general field theory these could lead to relatively large O(a) effects in addition to the safer O(a/M2,) 
ones. However, for QCD and similar theories, these only affect quark masses and can be absorbed into their 
renormalization (Weinberg, 1973b). They could in principle be problematic for scalar-mediated theories, 
including supersymmetric QCD, but there they are suppressed by large superpartner masses. 
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using the notation of (3.113) and (3.114) on page 110. 
The two U(1) factors have generators 


1 1 
B= 5 [oez Bas 3 [oez (5.92) 
where J®:4 are the associated Noether currents 
Nia > 5 Tr Yudra» Je = 5 Gy dras (5.93) 


with r and a the flavor and color indices, respectively. B is just baryon number (4 for quarks 


and -4 for antiquarks), while B4 is the analogous axial baryon number. They are related 
to the chiral U (1) generators Br pr (i.e., Br = +3 for qL (qL) and 0 for qr(qr), and the 
reverse for Br) by B(Ba) = Br + Bz. The baryon number is an accidental symmetry that 
is conserved to all orders in perturbation theory in the standard model, though there may 
be a small non-perturbative breaking in the electroweak sector due to vacuum tunneling 
(instanton) effects (e.g., °t Hooft, 1976b; Schafer and Shuryak, 1998). Many extensions of 
the SM predict B violation at some very small level, which would lead to proton decay. 
However, there is no sign of the axial baryon number symmetry in nature, and in fact the 
pseudoscalar spectrum implies that it must be absent, as will be discussed in Section 5.8.3. 
It is now believed to be broken by non-perturbative effects associated with the triangle 
anomaly in the B4-gluon-gluon vertex and with instantons. 

Consider SU (3)z x SU (3)r or its SU(2); x SU (2)r subgroup. Define the generators of 
SU(3)z,R as F} p, respectively. Their Lie algebra is 


[Fi Fi] = ifignk®, [Fh Fè] = ifik FE, [Fi F$] =0, (5.94) 
and their commutators with the chiral quark fields are 
[pi Nib i 
[F}, qar] = — 5 DL» [F], qar] =0 
X (5.95) 
[FR qar] _ 0, [Fr qar] E F (bR, 


i.e., qz transforms as (3,1), a triplet under SU (3)z and a singlet under SU (3)pr. Similarly, 
qr transforms as (1,3), and the antiquarks as (3*, 1) and (1,3*). F$ g are associated with 
the Noether currents 
i -A si a Aa i PRE 
Jip = Uu ZU E Lug bL Try = RW AR- (5.96) 


It is useful to also define the vector and axial vector generators 
F° = F$ + F} = I Ve (x) dz, Fe = Fi Fi = / Ai (x) Bz (5.97) 
and their associated currents 
i i i _ x i i i _ 5X 
Vi = Jh + Jip = ugt An, = IRyp ~ Jip = Mu yt (5.98) 


The vV are associated with the weak and electromagnetic currents, while the At, enter the 
weak interactions. They do not commute, but rather satisfy 


[FŻ, FI] = ifij F", PP) = ifij F, A FI] Sa a Rh, (5.99) 
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The vector F’ generate the diagonal SU(3) subgroup of SU(3) x SU(3). 
The vector and axial currents Vi and At, are conserved for massless quarks. Turning 
on m = diag(my, Ma ms) # 0, one obtains from (3.51) on page 99 (or directly from the 


equations of motion) that 


7 z At . oN nN 
Ə" V, =—iq [Zl q, OMA, = 1g {Fim} y5q. (5.100) 
As expected, the axial symmetries are broken by the quark masses, even if they are degen- 
erate, while the vector symmetries are only broken by mass splittings. 


5.8.2 The (3*,3) + (3, 3*) Model 


The (3*,3) + (3,3*) model of SU (3)z x SU(3)r symmetry realization and breaking (Gell- 
Mann et al., 1968; Glashow and Weinberg, 1968) was inferred from the data prior to QCD or 
even the general acceptance of the quark model (Pagels, 1975), but emerges naturally in that 
framework. The basic idea is the assumption that the strong interactions have an SU (3)z x 
SU(3)r invariant part as well as a small breaking term. The observed hadron spectrum 
does not exhibit the full SU (3)z x SU (3)r symmetry,”! so the symmetries associated with 
the axial generators are assumed to be spontaneously broken. The explicit breaking term 
transforms as a singlet and octet under ordinary SU(3), and as (3*,3) + (3,3*) under 
SU (3), x SU (3)r. That is manifestly the case for the quark mass term in (5.90) and (5.91), 
which constitutes a concrete realization of the (3*,3) + (3,3*) model. 

The qualitative picture, which is supported by lattice calculations, is therefore that in 
the limit m, = mq = M, = 0 the symmetries associated with the 8 axial generators F° 
are spontaneously broken. According to the Nambu-Goldstone theorem of Section 3.3.6 
this implies the existence of 8 massless pseudoscalar Goldstone bosons, which are identified 
with the pseudoscalar octet n=}, K+, K°, K°,n. The SU(3)-singlet n’ is not a Goldstone 
boson because of the non-perturbative breaking of the associated B4 symmetry, as will be 
discussed in the next section. Unlike the examples in Chapter 3 the SSB of the F is not 
due to the VEV of an elementary scalar, but rather is associated with the VEV or vacuum 
condensate of a composite gq operator, i.e., in the massless limit 


(Olau|0) = (0|dd|0) = (0|ss|0) = vo = O(A) (5.101) 


due to the non-perturbative long-distance dynamics. Since the chiral symmetry is broken, 
the quarks must acquire a dynamical or constituent common mass Mayn ~ M,/ 3 ~ 300 
MeV proportional to A from the strong dynamics, i.e., from the interaction with the quarks 
and gluons in the condensate. 

To better justify these statements, let us define the composite pseudoscalar and scalar 
field operators 


PS _éM 5 _ At N x 
a te 14L oR + ORS aL 
l l . (5.102) 
eee: o x pas rv 
= F579 = 45 dR + IRJAL 


for i = 0---8, with \° = 3! and dojk = VEJS o° and 7° are SU (3) singlets and the 


21One would expect massless chiral fermions, and each meson isomultiplet would need to have at least 
one degenerate partner with the opposite parity, contrary to observations. 
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others are octets. In particular o? = (wu + dd + 3s)/V/6. It is straightforward to show from 
(5.95) that S — 
[F5 ni] = idije", [FP eo! = iden”. (5.103) 


Now assume the gq condensate in (5.101), which implies that (0|0°|0) = žr. Taking the 
expectation value of the first equation in (5.103), 


(0| [F", 27] |0) = iðijvo. (5.104) 


This implies that F’°|0) 4 0, i.e., the F symmetry is spontaneously broken. It also implies 
that there must be some angular momentum-0 state in the Hilbert space with a non-zero 
inner product with F’°|0), which we assume to be a single particle state that we denote 
|t (7)) under the anticipation that it is a bound state with the quantum numbers of the 7’ 
field. Using (5.97), 


(O|F?|n*(@)) = J ezoa). (5.105) 


But (0|AŻ (x)|z"(g)}) must be of the form 


i k(>\\ _ . fr —iqx 
(lA; (x)| a" (¢)) = iy du dixe oe (5.106) 
The q, follows from Lorentz invariance because q is the only four-vector available, the jx 
is from the unbroken SU(3), and the exp(—iq- x) follows from translation invariance. fr, 
known as the pseudoscalar decay constant, is illustrated diagramatically in Figure 5.18. It 


Pf 
q f aon q 
Bde ~~ i sq fr / 5? Pa NS 
yS an T’ ty Gnd 2 m S } 7 H 
q q2 \ po000" 
Pi 


Figure 5.18 Left: the pseudoscalar decay constant. Middle: pion pole contribution to 
the induced pseudoscalar form factor in the chiral limit. The cross indicates the 
axial current and q = pyr — pi. Right: two gluon intermediate state coupling to 
the 7’. 


is determined from the strong dynamics that lead to the bound state, and is expected to be 
of O(A). We will see in Chapter 7 that fr ~ 130 MeV can be measured from pion decay. 
Inserting (5.106) into (5.105) 


Sr 
V2’ 


where we can take t = 0 since F” generates a symmetry of the equations of motion. One 
can insert a complete set of states between F*° and 7 in (5.104). Assuming that only the 


(O|F?|n*(@)) = (277)? qoidind* (7) (5.107) 
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single particle states enter, or at least that they dominate, we find 


Ëg f T or 7 es 
Fanon (OF la" (@))(a*(@) a? 0) — (Olr |r" (g) a" (@)|F"? 0) } 
/ (21) °2qo (5.108) 


co dn l 
= iby Be = iĝijvo, 


where Z!/?, which has dimensions of mass?, is associated with the bound state wave function 
of the zë by ; 
(Olrta (g) = D255. (5.109) 


Defining v = v)/Z'/?, which has dimensions of mass, one has that 
f, 
V2 , 


which is nonzero if the symmetry is spontaneously broken. The Nambu-Goldstone theorem 
then follows from 0” A’, = 0 and (5.106) 


(5.110) 


(010A Jn*) = 0 = Fides q fr = MÈ fr = 0. (5.111) 


This summarizes the Goldstone alternative: either fy = 0 (no SSB) orm 
a massless spin-0 Goldstone boson). 

To justify the generation of a dynamical mass Mayn # 0 let us first derive the Goldberger- 
Treiman (GT) relation. To simplify the notation, we work with the SU(2); x SU(2)r 
subgroup. Since the quarks do not appear as physical states, consider the nucleon isospin 
doublet y = (Wp Wn)’, as in Section 3.2.3. The matrix elements of the axial currents 
between the nucleon states will be considered in more detail in Chapter 7, but for our 
present purposes all we need is that the most general form consistent with the strong 
interaction symmetries is 


2 


= 


0 (there exists 


j = Ti iqx 
(sla Ebi) = üs [ugla + augl] yue, (5.112) 
where q = pf—p; and gı ,3 are form factors. gı is a slowly varying function, with experimental 
value gı(0) ~ 1.27 from 8 decay, while the induced pseudoscalar g3 has a pole at q? = 0 
from the massless Goldstone boson, as indicated in Figure 5.18. Using (5.106) it is given by 


B) ~ = Gn (5.113) 


where Gr ~ 13.1 is the effective pion-nucleon coupling defined following (3.84) on page 103. 
Using or A$, and evaluating the divergence of (5.112) at q? = 0 one obtains the GT 


relation tG 
0)M, = ==. 

gı (0) Pp V2 
(5.114) should hold in the chiral limit, and experimentally it is satisfied to better than 
2% (Gorringe and Fearing, 2004). Our present purpose, however, is the observation that 
even in the chiral limit one has that M, « fr # 0 is induced by the chiral breaking, and 

that presumably the quarks also acquire dynamical masses Mayn ~ Mp/3. 

The picture is perturbed when non-zero current quark masses m, are turned on in 


(5.114) 
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Loop. For my, = mg = ms, # 0 the axial generators F” are explicitly broken and the 
pseudoscalars acquire small mass-squares Mg x Mq, i.e., they become pseudo-Goldstone 
bosons, analogous to the example in Section 3.3.3. Also, the constituent quark masses are 
shifted to effective values 

Mg = Mayn + Mq. (5.115) 


The pseudoscalar masses may be obtained by taking the matrix element of 
te yi Be i 
o" A, = img qq = —2mT (5.116) 
from (5.100) between the vacuum and one Goldstone boson state. Using (5.106) and (5.109), 


fay? =-ImZ1/? =e, (5.117) 


where the explicit symmetry breaking parameter € has dimensions of mass®. Finally, it is 
convenient to define a renormalized field 7! = 1'/Z'/? so that 


(O|#?|@*(Z)) = dye. (5.118) 


In terms of 7’ (5.116) becomes 
OHA’ = Fn meii, (5.119) 


which is known as the partially conserved axial current (PCAC) relation (Gell-Mann and 
Levy, 1960). It again displays the Goldstone alternative in the limit OV At = 0, and can 
be used as the basis for soft pion theorems, which can be used to calculate amplitudes for 
physical pions at low energy (Adler and Dashen, 1968). 

Breaking the degeneracy of the quark masses leads to breaking of the vector generators 
F". For Mms > my, = mg the vector SU(3) is broken to SU(2) (isospin), while mg # my, 
breaks the isospin symmetry, as described in Section 3.2.3. The diagonal generators F? and 
F®, which are related to electric charge and strong hypercharge by (3.89), are unbroken. 
The ratios of the current masses cannot be determined directly because of confinement and 
because of Mayn. However, they can be inferred from the observed SU (3)z x SU(3) Rr sym- 
metry breaking effects (Weinberg, 1977; Langacker and Pagels, 1979; Gasser and Leutwyler, 
1982). To lowest order in symmetry breaking, 


2 2 2 2 
My _ 2M20 — MI, + Me — Mo 
= 2 2 2 
ei Cr (5.120) 
2 2 2 i 
Ms _ Mko t Mkt — Mit 
= ae 2 2? 
Ma Mko — Mig FMA, 
or ignoring isospin breaking, 
2 
m m 
E xv (5.121) 


m? matm, 
In practice, higher-order corrections are needed, and additional information on isospin 
breaking is obtained from the proton-neutron and K+ — K? mass differences (which also 
have electromagnetic components that must be estimated), p — w mixing, the 7 — 3r and 
p! — pr? decays, and isospin breaking effects in nuclear physics (e.g., Gasser and Leutwyler, 
1982). Typical estimates of the ratios are 


Ms 


~17-22, Z= n04- 0.6. (5.122) 
Mad Maq 
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The larger value of mg compared to m, explains why Mp > Mp, since the electromagnetic 
effect would make the proton heavier (the two contributions are of the same order of magni- 
tude). Similar statements apply to ore = Mo < 0, while iy —m, > 0 is almost entirely 
due to electromagnetism (both m2, and m2, are proportional to my, +ma to leading order). 

The absolute scales are more difficult, since they depend on strong interaction matrix 
elements of the operators in (3.113), and also on the renormalization scheme and scale (see 
the review on Quark Masses in Patrignani, 2016). Most estimates are in the range 


Mau ~2—3 MeV, Ma ~ 4—5 MeV, ms ~ 90 — 100 MeV. (5.123) 


These are actually running masses, evaluated at a scale u ~ 2 GeV. The u and d masses are 
much smaller than other strong interaction scales, such as the nucleon mass or A, i.e., both 
SU(2) and SU (2); x SU(2)r hold at the < few % level. The near exactness of the chiral 
symmetry is most evident from the fact that m2 is so tiny compared to other hadronic 
mass-squares. The success of the vector isospin symmetry is because my, and mg are both 
extremely small even though their ratio is not close to unity, and also because a is small. 
Ms is small compared to most hadronic masses, but not negligible. That is why the SU(3) 
breaking effects are typically of O(25)%. The one counterexample is m7, /mĉ2, which is large 
because of ms >> Mu,a and the special role of the K and 7 as pseudo-Goldstone bosons. 
We also record typical values for the heavy quark masses 


me(me) ~ 1.2 — 1.3 GeV > me” ~ 1.6 — 1.7 GeV 
my(mp) ~ 4.1 — 4.2 GeV > m?” ~ 4.8 — 5.0 GeV (5.124) 
mi(m,) ~ 163.7 £ 0.7 GeV => m?” ~ 173.3 + 0.8 GeV. 


The first values are the running (MS) masses, while the second are the propagator pole 
masses, which correspond most closely to kinematic masses. They are related by 


4a,(m? 

mele ~ mm) h m oe) (5.125) 
3 T 

The relation is actually known through O({a,(m?)/z]°)(Melnikov and Ritbergen, 2000). 

The series converges very slowly, especially for mp and me, and it is best to work in terms 

of the running masses to minimize the associated uncertainties. 


5.8.3 The Axial U(1) Problem 


One aspect of QCD and other quark-gluon models that is not present in the general (3*, 3)+ 
(3, 3*) model is that the symmetry is U(3); x U(3)r rather than just SU (3)z x SU(3)R x 
U(1)z as the quark masses go to zero. Assuming that the dynamics of the extra axial U(1) 
generator F°5 = B4 are similar to those of the F’°’, one might expect: that it is spontaneously 
broken, F°°|0) 4 0, so that there would be a ninth pseudo-Goldstone boson in addition to 
the m’s, K’s, and 7. Although the 7/(958) has the appropriate quantum numbers, it turns 
out that either Mm}, My, or both are just too large to be consistent with the 7 and K masses, 
independent of the assumptions on the spontaneous breaking. Even if the F°’ symmetry is 
not spontaneously broken, the axial U(1) from U(2)z x U(2)r will be, with m, predicted 
to be too light when the quark masses are turned on. To be more explicit, using standard 
current algebra techniques and ignoring SU(2) breaking for simplicity (i.e., m3 = 0), one 
can show (Weinberg, 1975) that the two mass-squared eigenvalues of the mixed n- n’ system 


are 5 
3m 4 5 1 
Pee mg [1+ 5.126 

14222’ è +z) (Ee 
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up to corrections smaller by O(m2/m?2,), where z = fo/fx, with fo the decay constant for 
F°” defined analogously to fr in (5.106). For z = 1 the two mass eigenvalues are predicted 
to be My = mz ~ 140 MeV and my = V2mK ~ 700 MeV, respectively, compared with the 
observed 7 and 7’ masses of 548 and 958 MeV. For z — 0 (no spontaneous F° breaking) one 
state becomes very heavy (not a Goldstone boson), but the lighter eigenvalue is 3m, ~ 243 
MeV. The latter value is the upper limit on the lighter mass. This conflict with observation 
was known as the U(1),4 problem or the 7 problem. There were also difficulties associated 
with the 7 decay. 

To resolve the U(1)4 problem there needs to be a contribution to the explicit breaking 
that is independent of the quark masses. The SU(3)-singlet n’ is distinguished from the 
octet states 7, K, and 7 by the third diagram in Figure 5.18. The two-gluon intermediate 
state is a singlet under flavor SU(3) and can therefore couple to the 7 but not to the octet 
states in the SU(3) limit. This is not by itself sufficient to solve the problem, however. 
Another necessary ingredient is the triangle anomaly in the By-gluon-gluon vertex, which 
violates the symmetry and leads to a divergence (Adler, 1969; Bell and Jackiw, 1969) 


Nq 2 
o AC = 2; 5 m, qr +n Js Giver Gi 


i672 ae (5.127) 


r=1 
where AS, Sy GrYn¥ dr. We have included the explicit breaking from the quark masses, 
Nq = 2 or 3 is the number of flavors being considered, and G is the gluon dual field strength 
tensor defined in (5.7). The GG term (which incidentally has the same structure as the 
$gcp term in (5.2)) appears to do the job, but there is a further complication in that it 
can be written as a four-divergence, 


ip ñi iv {mips , Js j as N= 
G 5 Gig E o" [éwaoG (G p F 3 fig GPG" )| = OM K. (5.128) 
The current i 
40 — 40 Js 
A; = A; — ET K, (5.129) 


is therefore conserved for m, — 0, apparently reintroducing the problem. However, K, 
is not gauge invariant, and it was shown (’t Hooft, 1976b, 1986; Schäfer and Shuryak, 
1998; Vicari and Panagopoulos, 2009) that there are non-perturbative vacuum tunneling 
gauge field configurations (instantons) that fall off sufficiently slowly that they contribute 
to surface terms when the divergence is integrated over space-time. The U(1)4 anomaly 
and instantons therefore generate an 7 mass that survives when the quark masses vanish, 
solving the U(1),4 problem and evading the Nambu-Goldstone theorem. 

There is another consequence of these considerations: since the current A® is formally 
conserved in the chiral limit, one can use the associated charge F® to generate phase 
rotations on the quark fields. For example, under the transformation U (8) = exp(—i@F°°) 
the quarks transform as 


q > U(8)qU~*(8) = eb q < (L,R `> et ar R (5.130) 


(cf. (3.28) on page 96). However, it can be shown (Callan et al., 1976; Jackiw and Rebbi, 
1976) that because of the gauge noninvariance of the K component, the @gcp term in 
(5.2) is not chiral invariant, and that 0gcp is shifted to 0g¢p — 2n,3 under the same 
transformation. An equivalent formulation is that there are a continua of possible vacua of 
QCD, labeled by 6gcp. However, the vacua are not invariant, 


U(B)|@gen) = |@gop — 2ngf). (5.131) 


The implications for the strong CP problem will be mentioned in Section 10.1. 
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The Pseudoscalar and Vector Nonets 


In Section 3.2.3 we described the Gell-Mann-Okubo formula for the nucleon masses in the 
presence of SU(3) breaking. The GMO formula (3.105) on page 108 followed from the 
assumption that the explicit breaking term in £ transformed as the 8” component of an 
SU(3) octet, as in the quark model. Similar considerations apply to the nonets of mesons, 
such as the pseudoscalar (7[138], K [496], 7[548], 7/[958]) and the vector (p[770], A*[892], 
¢[1020], w[782]). In each case, the first state refers to the isospin triplet, the second to the 
two isospin doublets, and the last two to the isospin singlets, which may be mixtures of the 
isosinglet in the octet and an SU(3) singlet. The numbers in brackets are the masses in MeV. 
(We ignore isospin breaking.) The vectors are interpreted as the lowest lying L = 0,5 = 1 
qq states in the SU(3) quark model. The pseudoscalars have the dual (and sometimes 
conflicting) interpretation as pseudo-Goldstone bosons, or as the L = 0,.S = 0 states in the 
quark model. 

The derivation of the mass formula differs in several ways from that in (3.105). First, as 
is clear from chiral perturbation theory or the form of the free Lagrangian, the analogs of 
(3.102) and (3.104) should apply to the meson mass-squares. Second, the meson octets are 
self-conjugate, so the two isodoublets are related and degenerate by charge conjugation. The 
apparent loss of a parameter is compensated by the fact that Bose symmetry implies that 
only the symmetric dig; term in (3.102) is present. Finally, the matrix elements in (3.102) 
only involve the states within the original octet. In principle, SU(3) breaking can mix the 
octet states with single particle states from outside the octet, or even with multiparticle 
states. That is not a significant effect for the baryons, but it is for the meson nonets, which 
involve an SU(3)-singlet that can mix significantly. The result is that the analog of (3.105) 
for the pseudoscalar and vector octets is 


Ami, = 3mz, + me, Amigs = 3m‘, +M, (5.132) 


where mz, and m4, are, respectively, mbgg and mi/gg, the 88 entries of the pseudoscalar 
and vector octet mass-squared matrices. However, the SU(3) breaking can also induce 
Moog = Mgo Or Mi-og = Mig Mixing terms between the octet and singlet, so the n — n’ 


and ¢ — w systems are really described by 2 x 2 mass matrices?” such as 


1 2 2 
-L = 5 (18 mo ) us n) ( be ) l (5.133) 


2 
MPos MPoo no 


where moo is the singlet mass-square in the absence of mixing. 7g and no are the original 
octet and singlet states, given in the quark model by 


1 = 1 - 
= — (uŭ + dd — 2ss) , = — (uŭ + dd + s5) . 5.134 
w= h m= I ) (5.134) 
Within this simple two-state approximation, the mass eigenstates are 
n = ns cos Op — No sin Op, n! = ng sin Op + no cos Op, (5.135) 


where Op is the pseudoscalar mixing angle. One finds 


Mp, = cos” Opm + sin? Opm. (5.136) 


?2Tsospin-breaking (e3) effects would extend the discussion to include 7? — n — 1’ and p? — ¢—w mixing. 
Also, mixing with excited octet or singlet gg states or with gluonium is possible. We will ignore these 
complications. 
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From the GMO formula in (5.132) as well as confirming (and sign) information from decays 
involving the 7 and 7 one obtains (Patrignani, 2016) @p ~ —11.5°. The rather small 
value for @p is easily understood in terms of our discussion of the U(1)4 problem. In the 
chiral limit the octet mass-squares all go to zero, while the Mp ~ my induced by the 
anomaly/instanton effects is large compared to typical SU(3) mass splittings of < few 
hundred MeV. Therefore, one expects |mog|, mbes K Mboo and relatively little mixing. 
Similar formulas apply to ọ — w mixing. In this case, however, the mixing angle 0y 
defined by 
ob = dg cos Oy — Qo sin by, w= dgsindy + ġo cosy, (5.137) 


turns out to be large, Oy ~ 38.7°. This is very close to the ideal or magic mixing angle 
Or = 35.3° (tan 0r = 1/2) for which ¢ would be purely s5, 
= De 7 2 2 2,1l 2 

or —ss, wn we + dd), Mg, = 3M + 3 Mo (5.138) 
This, and similar results for other nonets with sufficient data, is again easy to understand. 
The vector mesons are well described by the non-relativistic quark model. In the chiral 
limit there is no major distinction between the octet and singlet, and they should have 
comparable masses, perhaps dominated by 2Mayn. This is not exact. For example, the ġo 
could acquire additional mass from gluon intermediate states, analogous to the two-gluon 
diagram in Figure 5.18. However, by charge conjugation and gauge invariance at least three 
gluons are required, so the contribution is suppressed by O(a). In any case, as long as the 
the unperturbed singlet and octet mass-squared difference is small compared to the SU(3)- 
breaking effects one expects the heavier mass eigenstate to be mainly s5 since ms > Mu,a- 
(The precise connection to the 2 x 2 mass matrix in (5.133) is explored in Problem 5.15.) 
This ideal mixing picture is further supported by the expectation 


mi, — Migs ~ Migs — m? (5.139) 


p w) p? 


which is obvious from the quark interpretation and agrees reasonably well with observations. 
For the vector mesons, unlike the pseudoscalars, the approximate picture, including the near 
ideal mixing, still holds if one assumes the perturbations are linear in the masses. 


5.8.4 The Linear o Model 


Some of the formal arguments presented above are more transparent when presented in 
terms of an effective theory, the linear o model, involving the composite fields defined in 
(5.102). We will illustrate this for the SU (2)z x SU (2)r limit, with fields 

. t 1 1 _ 

T = -ig 4 o = 549 = 5 (uu + dd), (5.140) 
where į = 1,2,3 and the normalization of ø is chosen for convenience. (Neither the ø nor 
the 7 field in the next Section corresponds closely to any physical meson.) We assume that 
the strong QCD dynamics generates the effective interactions in 

_ 1 1 7 

Lo = pi pl + =(Oyo)? + = (Out)? — grb [0 — iT FY] 4 
2 2 
9 X (5.141) 
-E (o +z?) — Â (0? +7?) + ao, 
2 4 
where Y% = (Yp Yn)” is the nucleon doublet. Lo is SU (2)z x SU (2)r invariant except for the 


linear ao term, which mimics the current quark masses and explicitly breaks the symmetry. 
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W.l.o.g., we can take a > 0. We see explicitly from (5.141) that if SU (2)z x SU(2)r were 
unbroken (a = 0 and no spontaneous breaking) the nucleons would be massless and the 
pions would have a degenerate parity-even partner (the a). The phase in which the axial 
generators are spontaneously broken is more realistic. 

To see the SU (2)z x SU (2)r invariance of Lo explicitly, define a 2 x 2 matrix 


ene {a= int 
Mac a ~ V2 gahe (5.142) 
UT V2 


which is reminiscent of our treatment of the adjoint representation of a simple group in 
Section 3.2. Then 


Lo = bri dvr + bri yr + 5m [om (M)| (5.143) 
2 
— V29n (YrMYr + oi M' ye) — ET (MM") — [Tr (MMt)]° Pag. 


WL,R transform the same way as qLŁ,rR, i.e., as (2,1) and (1,2), 


SU2Q),: dae Fd, drodr 
ve (5.144) 
SU(2)r: oi v1, Yr >e = op. 
It can be shown (Problem 5.16) that M transforms as (2*, 2), i.e., 
M > élei Mets, (5.145) 


so that Lo is invariant except for the ac term. The infinitestimal form of (5.145) is 


RIOR-Ax#+Bo, o>o-—-ß-zF, (5.146) 
where 2 - 5 3 
pa eo a= Reh (5.147) 


are associated, respectively, with the axial and vector generators. The meson terms in (5.141) 
and (5.143) are invariant under SO(4) for a = 0, with (ø, 7) transforming as a vector. This 
is not anew symmetry, but just reflects the equivalence SO(4) ~ SU(2) x SU(2). Of course, 
the vector SU (2) symmetry is preserved even for a 4 0. 

The minimization of the scalar potential 


(0? + #?)* —a0 (5.148) 


is straightforward, i.e., 


dV 
a [M+X(0? +#7)|o-a=0 
T (5.149) 
ma [p +A (o? + 7 *)] =D 
at the minimum. In the chiral limit, a = 0, one has 
' À >o 
O|7"|0) = 0, O|o|0) = v = 2 5.150 
(0|7"|0) (0|o|0) =v =F, <0, (5.150) 
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where we assume that any nonzero VEV is in the ø direction to ensure a smooth a + 0 
limit (Dashen, 1971). For a 4 0, one has that (0|o|0) = v Æ 0 and (0|7*|0) = 0. The precise 
value of v can be obtained by solving the cubic equation in (5.149). However, we have no 
need for an explicit expression. For both a = 0 and a > 0 one can expand V(v + 0’,7) to 
obtain 

m2 = pw? + dv? > mv =a. (5.151) 
Again, this displays the Goldstone alternative, i.e., either m? — 0 or v > 0 in the symmetry 
limit a — 0. One also has that a dynamical nucleon mass 


My = Gr (5.152) 


is generated for v Æ 0. The o’ mass is \/—2? for a = 0. However, the o’ is not a realistic 
aspect of the model, so the exact value is not important. From (3.46) the axial current is 


Ai = Burs + (Quo) Tİ — (Ayr) 0. (5.153) 


From (5.106) this suggests that the pion decay constant is related to v by (5.110), at least 
up to renormalization effects. Then, from the equations of motion or from (3.50) on page 99 
one recovers the PCAC relation 

— S 

OM AY = art = vm r = m2r. 5.154 

jt = 5 (5.154) 
The phase diagram for the model as a function of u? and a is further discussed in (Lee, 
1972). 


5.8.5 The Nonlinear o Model 


The PCAC relation leads to various soft pion theorems for the behavior of amplitudes 
involving low energy pions (Adler, 1965b; Adler and Dashen, 1968; D’Alfaro et al., 1973). 
However, these can be made more explicit, and the relation of the pseudo-Goldstone boson 
pion fields to the SSB more obvious, by redefining the fields (Weinberg, 1967b). There are 
various possible reparametrizations (see Donoghue et al., 2014, for a very nice discussion). 
One closely related to the discussion in Section 3.3.5 is to define fields 7 and E such that 


M =u(é)¥? (=) IU(È)"2 = (=) IU(È), (5.155) 


where € is interpreted as the pion field and 
U(E) =e 7/", (5.156) 
Then, E disappears from the meson self-interaction terms, yielding 


ai Doi 1 2, +n? uir 
Lo = Wri Phr + Yri dvr + 5 Oun) + —y Tr [(0"U)' (0U) 
TEA oe X (5.157) 
_ KETNET T rri oB z A 4 
V2 ( 7 ) Gnd n + baw wr) 5 | +n) qu +n) ; 
where we have taken a = 0. However, E reemerges as derivative interactions from the |0,,U|? 
term, which can be expanded as a power series in €/v to all orders. The leading terms are 


V+ Way anian] =i (€) + 25 G oE) -IEP (0,ë) Í . (5.158) 
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Similarly, one can eliminate the (apparently) non-derivative pion-nucleon interaction by 
defining new fermion fields 


=U, bp =U bp, (5.159) 
so that the nucleon terms in Lo become 


Ly = Fri PUY, + Pri Dlg — V2gn (X42) (Pav, + Five) 
J2 


+ ip U? (GUM?) W, +iPRU T? (BUY?) Yh 


(5.160) 


U'/? can again be expanded in a power series in ¿t /v. The leading pion-nucleon interactions 
are (dropping the primes) 


Leg =- (OE) b- aT- (E x 0,2) v. (5.161) 
By comparing the matrix elements between physical nucleon states of the terms linear in 
T and € in (5.141) and (5.161) one immediately recovers the Goldberger-Treiman relation 
(5.114). 

The ø model, especially as formulated in (5.157), is useful for computing processes 
involving low energy pions. The ņ field decouples for large mass and can be ignored, yielding 
the nonlinear o model. This and equivalent representations can be used as a basis for chiral 
perturbation theory (Pich, 1995; Bernard, 2008), which summarizes the implications of 
spontaneously broken chiral symmetry and the associated soft pion theorems for low energy 


hadronic processes. 


5.9 OTHER TOPICS 


QCD and the strong interactions are an enormous subject, and we have only been able to 
scratch the surface here. An incomplete list of other topics that were omitted or only touched 
on includes the connection to nuclear forces (Epelbaum et al., 2009; Machleidt and Entem, 
2011); modern highly-efficient methods for the calculation of scattering amplitudes (e.g., 
Parke and Taylor, 1986; Witten, 2004; Elvang and Huang, 2015; Dixon, 2014); the connec- 
tion with string theory (e.g., Domokos et al., 2009); the application of ideas derived from or 
motivated by string theory, such as the AdS/CFT correspondence (Maldacena, 1998; Kle- 
banov, 2000); and computer-based computational methods for hadronic processes. Some 
of the packages used for the calculation of matrix elements, event generators, and parton 
distribution functions are listed in the website section of the bibliography. 


The Hadron Spectrum 


The spectra, decays, and other properties of ordinary gq mesons and qqq baryons and their 
interpretation in QCD are discussed in, e.g., (De Rujula et al., 1975; Godfrey and Isgur, 
1985; Rosner, 2007; Eichten et al., 2008; Klempt and Richard, 2010; Patrignani, 2016). 

The qq and qqq states in (5.1) are not the only possible color-singlet hadrons (Jaffe, 
2005). For example, there could exist gluonium or glueballs (bound states of gluons), which 
are hard to observe because of complications from mixing with ordinary q@ states (Klempt 
and Zaitsev, 2007; Ochs, 2013). 

Exotic quark combinations are also possible, such as tetraquark (qqqq) mesons and pen- 
taquark (qqqqq) baryons. In recent years a number of “XYZ” tetraquark states involving 
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cé or bb, but which are not consistent with ordinary charmonium or bottomonium, have 
been reported (Olsen, 2015). These could be molecular, i.e., consisting of pairs of loosely 
bound ordinary mesons, i.e., (qg)(qq), with each (qq) a color singlet. They could instead 
be diquark-antidiquark bound states, with the diquark (qq) in a 3* (or a 6) of color, and 
the converse for the (qq) antidiquark. They could also be admixtures of these and could 
contain, e.g., a c€ component. The LHCb collaboration(Aaij et al., 2015) has reported two 
uudcé pentaquark states in the A? + J/w K~p decay, appearing as resonances in the J/1) p 
channel. Possible interpretations again include molecular bound states of color singlets, such 
as (udc)(ué), or diquark-diquark-antiquark combinations like (ud)(uc)é. The experimental 
and theoretical situation for tetraquarks and pentaquarks is reviewed in (Chen et al., 2016). 

Other possible exotic hadrons include dibaryons (6q, such as wuddss) and hybrids (qqGG). 
The deuteron can be considered a molecular dibaryon, but no other such exotics have been 
established. 


High Temperature and Density 


Techniques such as lattice field theory and the AdS/CFT correspondence have been very 
useful in extending the study of the strong interactions into new domains, such as high 
temperature and/or density (Das, 1997; Kogut and Stephanov, 2004; DeGrand and DeTar, 
2006; DeTar and Heller, 2009; Ohnishi, 2012; Satz, 2012). Under such conditions there may 
be a phase transition to a plasma consisting of quarks and gluons, such as presumably 
existed above some critical temperature To of order A in the early universe (Kolb and 
Turner, 1990). The chiral symmetry, which is spontaneously broken at T = 0, may also be 
restored at high T, perhaps at or close to Te. These extreme conditions are also probed in 
heavy ion collisions in the RHIC collider at Brookhaven and the ALICE experiment at the 
LHC (Armesto et al., 2008; Florkowski, 2014), and may be relevant in extreme astrophysical 
environments such as the cores of neutron stars (Alford et al., 2008; Brambilla et al., 2014). 


5.10 PROBLEMS 


5.1 The Han-Nambu model (Nambu and Han, 1974) was an alternative to the color-quark 
model involving integer-charged quarks. Like QCD, it assumed that the u, d, and s quarks 
each transformed as triplets under the ordinary global SU(3) flavor symmetry, and also as 
triplets under a second SU (3)e, which we will refer to here as color for simplicity. The SU (3)e 
might or might not be gauged. (Nambu and Han actually assumed a (3,3*) assignment 
rather than (3,3).) Like QCD, it was assumed that the ordinary baryons and mesons were 
SU (3) singlets. However, they allowed a more general electric charge generator than (3.89), 
i.e., Q = T3 +T8/V3 +aT3 + bT8, where T’ (TŻ) are the SU(3) (SU(3)-) generators, with 
b and c chosen so that the electric charges are integer rather than fractional, i.e., 


RGB R G B 

2 2 2 

QHN = d —1 0 0 , VS. Qecp = d -4 -4 -4 
as W210: 0 re ee ee 


(a) Show that the ordinary baryons and mesons have the correct electric charges. 

(b) Find b and c. 

(c) How are the tests (R, Drell-Yan, 7° — 2y) for the number of colors in Section 5.2 
modified? 
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5.2 Calculate the color- and spin-averaged differential cross section da/dcos 6 in the CM 
for cd > cd in QCD at tree level, where c and d are the charm and down quarks, respectively. 
Keep me but neglect ma. Express your result in terms of gs, 5, Me, and cos 6. 

Hint: verify that your result agrees with Table 5.2 in the appropriate limit. 


5.3 Suppose that the QCD gauge symmetry is extended to the chiral color group SU(3)r x 
SU(3)pr, where the two factors have gauge couplings grr and gauge bosons Gi, R i= 1-8, 
and the chiral quarks qz and qpr transform as (3,1) and (1,3), respectively. Thus, the quark 
gauge interactions are 

La = -9L L Gi: Lar — grar ER Lar, 
where L' = \*/2 acts on the color indices and the flavor indices are suppressed. (Do not 
confuse this with the chiral flavor symmetries.) Introduce in addition a complex scalar field 
Pab, a,b = 1---3, transforming as (3,3*). ¢ can be represented as a 3 x 3 matrix, as in 
Problem 3.26. 
(a) Write the renormalizable gauge, Yukawa, and potential terms in £ involving ¢. 
(b) Suppose the potential is minimized for (¢) = vg I, where I is the 3 x 3 identity matrix. 
Show that the gauge symmetry is broken to the diagonal (vector) SU(3), i.e., to QCD, and 
show how the QCD gauge coupling, gauge bosons, and generators are related to those of 
(c) Find the mass of the other gauge bosons (the aziglwons) and their coupling to the quarks. 
Show that the interaction is purely axial in the special case gz = gr. 
(d) Show that the original SU(3), x SU(3)r theory has triangle anomalies. Suggest a 
simple way to avoid the anomalies, and show how the interactions are modified. (Ignore the 
electroweak gauge symmetries.) 


5.4 Calculate the spin and color-averaged CM cross sections for qg + QQ and qq > qoqo, 
where Q is a heavy quark with mass mg and qo is a spin-0 color triplet with mass mo. 
Neglect the mass of the initial quarks. 


5.5 Suppose the heavy go in Problem 5.4 were stable. Then, goGo pairs produced in the early 
universe could annihilate into quarks and gluons. The relic density of scalar quarks that do 
not annihilate depends on the thermal average of G(qoGo > ¢@)8rei and F(qoGo > GG) Bret, 
where Brel = lba — Bal (Kolb and Turner, 1990). Calculate these quantities in the CM (not 
the thermal average), assuming that mo is much heavier even than the top quark. Show 
that near threshold, s = 4m, the rate into quarks is P-wave suppressed (i.e, is proportional 
to 6”, where £ is the qo velocity), while the rate into gluons is not. 


5.6 Use a computer algebra program to calculate |M|?/g? for: 

a) GrGr > GsGs, r #8, keeping the masses m, and ms. 

b) GG —> qq keeping the quark mass m. 

c) GG > GG (i.e., verify the result in Table 5.2). 

d) Plot the total cross sections in nb (~ 2.6 TeV”) for qq > tt and GG —> tt vs ys for 
2M; < V/s < 4 TeV. Take a, ~ 0.12 and m; ~ 173 GeV. Ignore mg. 


( 
( 
( 
( 


5.7 The two-loop RGE for the QCD coupling for ng quark flavors is 


bza 


s? 


where 
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(The one-loop coefficient bz is —47b, where b is defined in (5.34), because the equation is 
written for a, rather than for g?.) Show that the solution can be written for large Q? as 


2 
1 bz InIn(2) 1 
as(Q’) = 1 +0|—— }. 
ow te | b3 nS n? 2 


5.8 In QCD the strong fine structure constant a,(Q?) is predicted to run according to 
the renormalization group equation in (5.37) (obtained from (5.34)). The measured value 
at Q = Mz ~ 91.2 GeV is as(M2) ~ 0.12. 

(a) Calculate the predicted value of a,(M?), where Mp ~ 1.2 x 101° GeV is the Planck 
scale. Assume nq = 6 and neglect all fermion masses, including my. 

(b) In supersymmetric QCD one adds to the particle content 8 gluinos, which are Weyl 
fermions transforming according to the adjoint representation of SU(3), and 2n, scalar 
quarks (or squarks), each of which is a complex spin-0 particle transforming according to 
the fundamental representation. Calculate how (5.37) is modified in supersymmetric QCD 
and the new predicted value at Mp (still assuming the value 0.12 at Mz). Neglect the gluino 
and squark masses. 


5.9 Show that the perturbative (short-distance) part of the non-relativistic potential 
(defined in Problem 2.33) between a quark and antiquark in a color-singlet state is 
V(r) = —42ü/r) 


r 


5.10 The well-established charmonium (yw) and bottomonium (T) L = 0 spin-triplet states 
are respectively 7(nS),n = 1,2 and T(nS),n = 1...4, where (1S) = J/w, with masses 
M(ns) = (3.097, 3.686) GeV and My(ns) = (9.490, 10.023, 10.355, 10.579) GeV. Show that 
these masses can be reproduced to S 1% in the nonrelativistic model using the potential 
in (5.89). Allow me and mẹ to vary in the vicinity of myig)/2 and my(1s)/2, but fix 
Qs = 0.24 for charmonium, a, = 0.18 for bottomonium, and « = 0.2 GeV? for both. (More 
complete studies vary « and a,, include relativistic and spin-dependent corrections, and 
include information from decay rates, as in Problem 5.11). 

Hint: solve the radial Schrödinger equation numerically, and iterate the values of me and 
mp until the mass of the 1S state is reproduced. 


5.11 The J/1(3100) is a narrow charmonium (cc) resonance with orbital angular momen- 
tum L = 0 and spin-1, which was predicted (Appelquist and Politzer, 1975) before the 
observation of charm. Assuming that the J/y) can be considered a non-relativistic bound 
state, show that the lifetime for J/y — ete~ by one photon is 


4 0)|? 
Tw ate e? p Or | ; 


me 


where we have taken mj/,4, ~ 2m, and neglected the electron mass. Ne = 3 is the number 
of colors, ee = 2/3 is the electric charge of the c quark, and w(0) is the wave function at 
the origin. 


5.12 Derive the QCD equations of motion for q and g for three quark flavors using £4 in 
(5.90), and use them to prove (5.100). 
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5.13 The pion-nucleon o term ory is defined as 


_ m 
— 2Mn 


where mî = (Mu + ma)/2. It can be thought of as the contribution of the chiral SU(2)- 
breaking quark masses to the nucleon mass, and it can be measured independently in nN 
scattering (e.g., Cheng and Li, 1984). One way to estimate ory (Cheng, 1976) is to write 
the matrix element as 


OnN 


(N|ūu + dd|N), 


(N|ūu + dd — 28s|N) + 2(N]|5s|N), 


and then invoke the OZI rule (based on simple quark model ideas) to neglect the second 
term. The first matrix element can be related to the shift in the nucleon mass due to 
SU (3)-breaking, as in (3.103). Show that 

3 M 


OnN™~ = = (Ma + My — 2Mn) ~ 25 MeV 
2m, —™m 


in the OZI approximation. Note that estimates from aN scattering and from lattice 
QCD (Kronfeld, 2012) are typically higher than this value, suggesting that the OZI ap- 
proximation is not very good in this case. 


5.14 Derive (5.128). 


5.15 (a) Derive the meson GMO formula in (5.132). 

(b) Express the elements of the mass matrix in (5.133) in terms of the mixing angle and 
eigenvalues. 

(c) Show how mg, mo, and mg must be related to obtain ideal mixing. 

(d) Show that ideal mixing and the assumption that the octet and singlet are degenerate 
in the SU(3) limit lead to (5.139). 


5.16 Show that (5.103), (5.145), and (5.146) are all equivalent for infinitestimal Bg = 
—ßr = b. (The transformation of M under @ follows from the ordinary SU (2) considerations 
in Chapter 3.) 


5.17 (a) Consider the linear ø model in Section 5.8.4. It is claimed that the most general 
SU(2), x SU(2)r invariant renormalizable Lagrangian density is given by (5.143) with 
a = 0. Why is there no need for a term 


Tr (MİM MiM)? 


(b) Suppose we have two sets of four real fields Mı = (o1 + ifi -7)/V2 and Mz = (a2 + 
ifa - 7) /./2, which transform as 


M; => eibrF/2 Vf e-iBn 7/2, 


Write the renormalizable SU(2), x SU(2)r invariant interactions and mass terms that 
involve both Mı and Mg. Include any quadratic terms that mix the two. 

(c) Find quadratic terms that, when added to the Lagrangian, would break the symmetry 
to the diagonal SU(2) group with generators F° = Fi + Fh. 

(d) Find quadratic terms that would break the symmetry from SU (2)z x SU (2)r to SU(2)r. 


CHAPTER 6 


Collider Physics 


DOI: 10.1201/b22175-6 


Most of our knowledge of the elementary particles and their interactions comes from exper- 
iments performed at accelerators and colliders. In fixed target experiments primary particle 
beams (e.g., e~, p, ion) from an accelerator collide with a fixed macroscopic target. They 
have the advantage of high relative fluxes because of the target density. They can also 
generate secondary beams of particles or antiparticles produced directly in the primary 
collision, by subsequent decays, or by radiation. These may be unstable or neutral, and 
include mesons, hyperons, e*, u*, v, D, and y. Colliders have the complementary advan- 
tage of allowing higher center of mass energies. For example, in the CERN and SLAC 
ete experiments at the Z-pole each beam had an energy of Mz/2 ~ 45 GeV, while the 
production of a Z utilizing an e+ beam scattering from atomic electrons would require 
E+ ~ M3 /2m. ~ 8 x 10° GeV! 

The history and machine physics of accelerators and colliders is extensively described 
in (Panofsky and Breidenbach, 1999; Sessler and Wilson, 2014; Wiedemann, 2015; Patrig- 
nani, 2016), and particle detectors in (Kleinknecht, 1998; Green, 2000; Grupen and Shwartz, 
2008). For introductions, see, e.g., (Mann, 2010; Tully, 2011). 

Here we will mainly be concerned with the particle physics implications of recent high 
energy e+e, etp, pp, and pp colliders.! e+e~ results, especially the Z-pole experiments 
at LEP and the SLC, will be discussed in Chapter 8. The e*p collider HERA at DESY 
extended neutral current deep inelastic scattering (e~p — e+X by y and Z exchange) into 
new kinematic regions (Figure 6.1), and also studied the charged current (ep > v(¥)X by 
W exchange) case. 

This chapter considers physics at the Tevatron (pp at \/s ~ 2 TeV) and LHC (pp with ys 
up to 13-14 TeV), especially the short-distance processes that can be viewed as scatterings 
of the constituent quarks and gluons. These are essential for testing QCD and determining 
Qs; probing other aspects of the standard model in the production of electroweak gauge 
bosons, heavy quarks, and the Higgs boson; and for estimating both the signal and the 
background in searches for new physics beyond the standard model (BSM). 


1There are a number of proposals for future higher-energy e+e~ linear or circular colliders to study the 
Higgs couplings, search for new physics, and perhaps redo the Z-pole measurements at higher precision. 
The circular collider proposals include ys = O(100 TeV) pp collisions at later stages. There are even more 
exotic possibilities, such as a +p collider or colliders involving one or two high-energy photon beams 
produced by backscattering laser light from high-energy electrons. 
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Figure 6.1 Kinematic regions probed by deep-inelastic, pp, and pp scattering experi- 
ments. Plot courtesy of the Particle Data Group (Patrignani, 2016). 


6.1__BASIC CONCEPTS 


6.1.1 The Cross Section and Luminosity 


The cross section formula (2.51) on page 16 is useful theoretically, but cross sections are 
actually measured by scattering two beams of particles in a collider or by scattering a beam 
from a fixed target. In both cases, the reaction rate is o£, where £, the instantaneous 
luminosity, is the product of the number of incident particles per unit time and the number 
of scatterers (in the target or second beam) per unit area (Patrignani, 2016). For example, 
£ = (fN)(nd) for a beam consisting of bunches of N particles impinging with frequency 
f on a fixed target of number density n and thickness d. For a collider in which bunches 
with Nı (N2) particles in the first (second) beam collide head-on, £ = f NıN2/A where f is 
the crossing frequency and A is an effective area that characterizes the transverse size and 
shape of the bunches. An example is given in Problem 6.1. The total number of events is 
aL, where L = f £dt is the total (integrated) luminosity. 

Cross sections have units of area, and are conveniently expressed in units of energy 
for theoretical calculations. For experimental considerations, however, it is conventional to 
work in more practical units based on the barn (b), where 1 b = 10774 cm? = 100 fm? is 
typical of nuclear cross sections. Related units are 


2 


1b = 10? mb = 10* ub = 10° nb = 10" pb = 10" fb = 10" ab, (6.1) 


where the prefixes represent milli, micro, nano, pico, femto, and atto, respectively. The cross 
sections of some typical processes are around 90 nb for ete~ + wtp at ys = 1 GeV (from 
2.232), 140 fb for a 20 GeV neutrino scattering from a proton, and 10° nb for the total pp 
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cross section at LHC energies (e.g., Pancheri and Srivastava, 2017). However, most of the 
pp cross section involves soft (long-distance) processes. The hard (short-distance) events 
we are mainly concerned with have much smaller ø, e.g., of O(1000 nb) for inclusive jet 
production (pp > jet+X) at the LHC, 50 nb for inclusive Z production, and 20-50 pb for 
pp > Higgs +X, as shown in Figure 6.2. 


proton - (anti)proton cross sections 


Giot 


o «(E " > Vs/20) 


Ow 


82 


o,,(E," > 100 GeV) 


o (nb) 
events / sec for £ = 10° cm’s' 


M,=125 car ia 


WJS2012 


Vs (TeV) 


Figure 6.2 pp and pp cross sections for SM processes as a function of energy, with 
LHC energies of 7, 8, and 14 TeV indicated. HE LHC refers to a possible 33 TeV 
high-energy upgrade. The discontinuities are due to the use of pp (pp) at lower 
(higher) energies. Plot courtesy of W.J. Stirling, private communication. 


The corresponding units for £ are, e.g., “inverse barns per second” (b~!/s): 


1 b'/s = 10-3 mb" /s = 107° pb" /s = 107° nb™t}/s 


(6.2) 
= 107!? pb-+/s = 1071 fb™t/s = 10718 ab“ /s. 
A common rule of thumb for estimating the total number of events in a year’s running is to 
multiply o£ by 10” s, since there are close to 7 x 10’ s in a year, but experiments typically 
operate for only about 1/3 of that time. 
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6.1.2 Collider Kinematics 


Here we collect a few definitions and results relevant to kinematics at colliders. For more 
detail, see (Barger and Phillips, 1997; Han, 2005; Patrignani, 2016). 


Single Particle Variables 


In Chapter 2 and in considering decay processes one usually uses spherical coordinates to 
describe the final particle phase space. However, for studying short distance processes in 
QCD one is usually interested in large transverse momenta for the final particles. Moreover, 
in a hadron-hadron collision the CM of the incident hadrons does not in general coincide 
with the CM of the hard scattering subprocess of interest. For these reasons, it is often 
convenient to work in cylindrical coordinates, with the z axis along the beam direction. pr 
and y represent the magnitude of the transverse momentum and the azimuthal angle of a 
produced particle, i.e., 


p“ = (E, Dz, Py, Dz) = (E, pr cos y, pr sin f, pz), (6.3) 


with E = ym? + p?, pr = psin#, and p; = pcos@, where p = |p| and @ is the polar angle. 
The Lorentz-invariant phase space element is 


ap dp: dp. 

p 7 Pedpy-p = Prdpr dp -p = prdpr dp dy (6.4) 
where the last form will be defined below. pr, p, and dp,/£E are all invariant under longi- 
tudinal Lorentz boosts (i.e., boosts along the +z direction). 


p” can be rewritten as 


p” = (mr cosh y, pr cos y, pr sin y, mr sinh y), (6.5) 


mr = 4/m? + pd. (6.6) 


This quantity is sometimes referred to as the transverse mass or transverse energy, but we 
will reserve these terms for different quantitites. The rapidity y is defined as 


where 


1, E+p, E +p; 
=l = lñ 


n = 
2 E- pz MT 


= tanh“! 2. i 
an E (6.7) 


Equation (6.5) follows immediately by taking the cosh and sinh of (6.7). The rapidity is 
especially useful for expressing the effects of longitudinal boosts. From (6.5), (6.7), and the 
expression in Problem 1.4, the momentum of the particle as seen in another Lorentz frame 
moving with velocity 692 is of the same form, except 
y>y—Yo where y= zn 

dy is therefore invariant under longitudinal boosts, with dy = dp,/E from (6.5), leading 
to (6.4). There is no longitudinal momentum in the frame with 8) = p,/E, where p” = 
(mr, pr cos ọ, pr sin y, 0). 

The measurement of y requires a measurement of both p, and E, so it is often easier to 
work with the closely related pseudorapidity 


= tanh! Bp. (6.8) 


1, 1+ cosé 0 
= -l = — Ìn | tan — ; 
T= 3 1—cos0 n (tanz); (ea 
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which is measured directly from the direction of the particle, with the range 0 < 0 < m 
corresponding to +00 > 7 > —oo. Equation (6.9) is equivalent to 


1 
cosh n` 


cos = tanh n, sin? = (6.10) 
Rapidity is identical to pseudorapidity for m = 0 and reduces to it for p >> m except for 
very small angles (9 < m/p). pr, p, and 7 are therefore useful variables for describing the 
momenta of the final particles in a high pr event.” 

The angular separation between two tracks of momentum pı and ps is conveniently 
expressed as the (longitudinal boost-invariant) distance 


AR = \/(Ay)? + (Ag)? = V(y2 — 1)? + (p2 = p1)? (6.11) 


between them in the y-y plane.® (AR is often defined in terms of the distance in the 
n—v plane, i.e., as ,/(An)? + (Ay)?, rather than (6.11).) Jets can be defined operationally 
as consisting of the cone of particles separated from the center of the jet by a distance 
AR smaller than some reference value (e.g., 0.7). (Other jet definitions are mentioned in 
Section 6.2.) 

Events are often displayed on plots that indicate the the total transverse energy 


Er = Esind = E/ coshn (6.12) 


deposited in the electromagnetic and/or hadron calorimeters in cells in the 7 — y (or y— y) 
plane. If the masses of the particles in the jet can be ignored then Er is just the sum of 
their mp. An example of a dijet event from CMS is shown in Figure 6.3. 


E7(Gev) 
300 
200 Jet 1 P;: 585 GeV 
100 y s 


Run : 138919 
Event : 32253996 
Dijet Mass : 2.130 TeV 


‘Jet 2 Py: 557 GeV À 


` 


Figure 6.3 An example of an 7 — ọ plot, showing a CMS dijet event from the 7 TeV 
run of the LHC. Figure courtesy of CERN. 


2 The various components of the ATLAS and CMS detectors at the LHC have coverage up to |n| ~ 2.5—5. 
3For e+e~ the distance can be defined as the angle between two tracks. 


210 The Standard Model and Beyond 


Parton-Parton Scattering 


Consider the process HaHpg — F + X at a hadron collider, where H4,g refer to the two 
hadrons (e.g., pp or pp), F is specific partonic final state, and X are additional unobserved 
particles. In the short distance regime (e.g., large invariant mass § = pł, which typically 
leads to large momenta pr of the final particles), the cross section can be factorized into 
long distance terms that describe the constituent parton distributions in the initial hadrons 
and a short distance term describing the hard scattering of the partons to produce F (Drell 
and Yan, 1971), as illustrated in Figure 6.4. That is, 


o(HaHs + F+X)=)7 J doaden fh Gali (en. oysrlea,en), (6.18) 
ij 


where i and j label the parton types, such as q, g, and G; ff and f? are their parton 
distribution functions, which are the same ones that are relevant to deep inelastic scattering; 
£A,B are their momentum fractions; and up is the factorization scale, which can be taken to 
be, e.g., V8 or pr. 0ij;+F are the parton-parton cross sections, such as gq — qq or GG — qq. 
The u3 evolution of the PDFs and the hadronization of the q and g into F will be considered 
in Section 6.2. 


P 
= oa 
SS a 

> 
q qd ox 
-pny 
qr G q — 
A < 
pt 


Figure6.4 Diagrams for pp > qqX followed by the hadronization of the q and g jets. 
There are additional diagrams corresponding to GG > qq. 


The total momentum of the subprocess ij > F is therefore pr = zapa + gpg, where 
the hadron momenta in the CM are pa.p = (E4,B, +p), with Ea ~ Eg ~ p = |p| in the 
(relevant) high energy regime. The invariant mass-square for the subprocess is 


py = ~ TATBS E TS, (6.14) 


where s = (pa + pg)? ~ 4p°. One can express the individual £z4,g as 


tA = VTe™”, TB = JTe 4, (6.15) 
where i 
TA 
= lh“ d 
w= ae (6.16) 


is the rapidity of the CM of the subsystem F (not to be confused with the single-particle 
rapidity in (6.7)). This interpretation follows from pr = (xa + 2%g,0,0,24 — £g)p and the 
definition (6.5). The velocity of the system in the hadron CM is 


b =tanhy = oe (6.17) 
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ijr should be invariant under longitudinal boosts, i.e., it depends only § and not on x4 
or £p separately. 

It is sometimes useful to trade the variables x43 for rT = &/s and y. In practical 
applications, the cross section expression (6.13) may involve the factor f dz 4dzpO(8 — ŝo), 
where Sp is a physical or experimental threshold in the invariant mass of the system. But 
O(§ — ŝ0) = O(T — To) = O(a 4xB — To), where To = o/s. Thus, 


1 1 1 1 1 —3lnr 
| ava f dx pO(8 — 8) =| ava f dip =| ar f dy. (6.18) 
0 0 To To/LA To $ lnr 


The relation of the kinematic variables is shown in Figure 6.5. 


10% 0.001 0.010 0.100 1 
XA 


Figure 6.5 The fractional invariant mass \/7 = /8/s of a hard subprocess as a 
function of z4 for various rapidities y. The xg — yT plot is obtained by replacing 
y — —y. The rapidity range for a given 7 is from $lnr to -4 lar, which follows 
from the requirement that T < £A,B < 1. 


Transverse Variables and Missing Momenta 


The high pr particles relevant to a short distance process often involve one or more weakly 
interacting neutral particles that are stable or long-lived and therefore do not interact or 
decay in the detector. These could be neutrinos produced in weak decays of other particles, 
or they could be new “beyond the standard model” particles, such as the lightest super- 
symmetric partner in R-parity conserving versions of supersymmetry. In principle one can 
determine the sum of the four momenta of the missing particles by subtracting the total 
four momentum of the observed particles from the initial momentum (,/s,0). In practice 
this is impossible at a hadron collider because many undetected beam fragments exit the 
detector close to the beam direction.* This problem can be evaded by working in terms of 
the transverse components of the momenta, pr. In a high pr process it is a reasonable first 
approximation to ignore the transverse momenta of the scattering partons, and the trans- 
verse momenta of the beam fragments is small. Therefore, the total transverse momentum 


“In the real world there are other complications ignored here, including large numbers of low energy 
particles, resolution and threshold effects, inefficiencies in particle identification, cracks in the detector, etc. 
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of the subprocess can be approximated by zero, and the (missing) transverse momentum of 
the invisible high pr particles is given by 


prs = — X dir, (6.19) 


where the sum is over the observed particles. The magnitude of p7""** is the missing trans- 
verse energy (MET) E#”"s* (or Zr). It is sometimes also useful to consider the scalar sum 
Hr = >; \Pir|, as well as other variables, especially in searches for new physics such as 
supersymmetry. 

As a simple example, consider pp > W~ + X, where WT — e` De and all of the high 
pr particles in X are measured. In that case, one can determine the transverse momentum 
of the unobserved De 

Boer = -Per — X Bir. (6.20) 
ieX 

Since pp,z is not known, one cannot directly reconstruct the mass of the W. However, one 
can determine its transverse mass Myr, defined by 


Miyr = (Mor + Mer)? — (Poer + Der)” ~ 2po,rPer (1 — cos Yer), (6.21) 


where the mp are defined in (6.6). In the last form Yey = ~e— Pz, and we have neglected the 
lepton masses. It is straightforward to show that 0 < Myr < My. From a single event one 
cannot determine Mw. However, from the endpoint and shape of the Myr distribution for 
a large number of events one can obtain an excellent value. Of course, backgrounds and the 
finite W width must be taken into account. For more complicated examples, e.g., involving 
two missing particles, see (Barger and Phillips, 1997; Han, 2005). 


6.1.3 Soft Processes in Hadron-Hadron Scattering 


Most of the events in high-energy pp, pp, and other hadron-hadron reactions are soft, char- 
acterized by low momentum transfers and (typically) low multiplicity. They are associated 
with long distance effects and are best described by phenomenological models such as Regge 
theory. However, they are of great practical relevance in collider experiments, in which the 
particles from the short-distance processes are only a tiny fraction of those that are actu- 
ally observed. At LHC energies around 25% of the scatterings are elastic (pp —> pp). The 
remaining inelastic processes can be characterized as singly-diffractive, doubly diffractive, 
or non-diffractive. In single diffraction one proton remains intact, while the other is gently 
excited into a small number of particles with low transverse momenta and with rapidities 
not too different from the original proton. In double diffraction each proton is gently ex- 
cited. Both types of diffraction (as well as elastic scattering) are therefore characterized by 
rapidity gaps, i.e., regions in the y — @ plane with little or no activity. They can be approx- 
imately described by pomeron exchange.’ Most of the inelastic events are non-diffractive, 
in which the produced particles are distributed in y and ¢ with no large rapidity gaps. 

It is difficult in practice to measure the total number of inelastic events in a completely 
“unbiased” way since many of the diffractive scatterings leave little or no trace in the 
detector. One instead refers to minimum bias events. The definition is somewhat vague, but 
basically refers to all inelastic events observed in the detector with minimal and reasonably 


5 Also possible is central diffractive dissociation, in which both protons remain intact but additional 
particles are produced with y intermediate between those of the protons, i.e., two rapidity gaps. These may 
be thought of as due to the “annnihilation” of two pomerons emitted by the protons. 
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unbiased trigger requirements defined by the experimenters. (Singly diffractive events are 
often excluded.). Most minimum bias events have low multiplicities and low transverse 
momenta. Their detailed characterization is important, however, because they occur in 
the same bunch crossings as the rare hard events, complicating the observations. (Other 
complications include pile-up, i.e., multiple hard interactions involving different protons in 
the same bunch crossing, and underlying events. The latter are associated with the beam 
remnants, i.e., the partons not involved in the hard scattering, and include their additional 
low-pr scatterings (multi-parton interactions) and soft initial state radiation.) 


6.2 HADRON-HADRON SCATTERING AT SHORT DISTANCES 


Many different types of hard scattering processes pp > F + X or pp > F + X can occur 
at hadron colliders. These include pure QCD processes such as F = light quarks or gluons, 
which subsequently hadronize into jets, as well as standard model processes like F = y, Z, 
or W (Drell-Yan), with the W or Z observed through their subsequently decays to leptons 
or jets. Heavy quarks, Higgs bosons, and new physics particles are also possible, as are 
combinations of any of the above. Although the underlying short-distance scattering may 
appear simple, the full description of the event may be quite complicated. Let us examine 
the various parts one by one. 


The Parton Distribution Functions 


The basic formula is given by (6.13), which displays the separation of the cross section into 
short and long distance terms. This separation can be shown to hold to all orders in pertur- 
bation theory, and is expressed in the factorization theorems (Collins and Soper, 1987). The 
PDF's cs Bg A,B; Hp) are the same as those that enter deep inelastic scattering, provided 
that one uses the same factorization prescription. Their dependence on the factorization 
scale up is controlled by the DGLAP equations. These are known to NNLO, as described 
in Section 5.5.4. 
It is sometimes convenient to rewrite (6.13) using (6.18), i.e., 


dé, dlLi; . n ds dLij . 7 
o(HaHg > F+X)= > [Sa  §0i;4F(8) = S/F -g 8 tur (8), (6.22) 
wy ay 


dé dy & ds 
where i 
dLiz _ lA B e dLij al dz oA B Ee 
= = f; - = = — = |: . — 6.23 
ds dy s” (2) x)’ ds sj, r ` (2); a)? een 
with z = yrTe” and T = &/s. We have suppressed up and the labels A and B on dL;,;, 
and used that ijp is independent of the F rapidity. dij = 7 i, which has units of 


length?, is known as the parton luminosity. It expresses the effectiveness of the partons for 
hard processes at § in a way that is independent of F. We have multiplied and divided 
by § because the logarithmic integral f ds/8 is better behaved for numerical computations, 
and because the dimensionless §a(8) has a weaker energy dependence. Plots of d£;;/d8 
(evaluated at u% = 8) and of their ratios at different s (useful for evaluating the relative 
event rates at different collider energies) may be found in (e.g., Quigg, 2011) or at the 
websites in the bibliography. Examples are given in Figure 6.6. 

In general dL;;/d8 dy 4 dLji/d8 dy for i A j, and the ij and ji terms must be included 
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separately® in the sums in (6.13) and (6.22). For the special case A = B the dL/dê dy for ij 
and ji are related by y + —y (Problem 6.3). For example, d£4q/d8 dy will be larger in the 
y > 0 direction for pp because there are more q than g at large x, while dLgq/dê dy will be 
larger for y < 0. The integrated luminosities are equal in this case, i.e., dL;;/d8 = dL; /d8. 

The PDFs for q and q are reversed for p, so that, for example, LiP}, = Flea g, (for the 
same T and s). 


LHC Parton Luminosities (14 TeV) LHC Luminosity Ratio: 14 TeV/8 TeV 


lh 100 
50} 
107 
T 
= 1000.0 2 
Y © 10 
i) or 
= 04 a 
© 
105 
107° 1 


10 50 100 500 1000 5000 


V8 (GeV) 


Figure 6.6 Left: LHC parton luminosities at ys = 14 TeV as a function of V3, 
computed using the MMHT2014 NNLO PDFs (Harland-Lang et al., 2015). Gq 
refers to Gu + Gd, qq to uu + ud + du + dd, and qq to uu + dd. Right: ratios of the 
parton luminosities at 14 and 8 TeV. 


The Hard Scattering 


The hadron-hadron cross sections in (6.13) and (6.22) involve incoherent sums over the 
color-averaged/summed hard partonic cross sections o;;_,7(8). The initial parton spins are 
also averaged (unless one generalizes to polarized hadron beams) and are often summed 
over final spins as well. 

Examples of QCD and standard model parton-level subprocesses include 


qq > qq, qd > 44, qq © GG, Gq > Gq, GG > GG, 


Š 6.24 
qq > W,Z, y> ff, qq > (W, Z,)G, qq > WW, GG > H, d 


where W and Z are the electroweak gauge bosons and H is the Higgs. The leading-order 
amplitude-squared expressions for QCD processes (for massless partons) are listed in Table 
5.2 (one must replace s,t, and u by the invariants 8,t, and û relevant to the subprocess). 
More extensive listings, including mass effects and extensions to beyond the standard model 
processes may be found in, e.g., (Barger and Phillips, 1997; Patrignani, 2016). In practice, 
ijr must be calculated in perturbation theory to the same order as the PDFs. Many 
processes are known to NLO” and some to NNLO. 

The hard inclusive cross sections actually depend not only on the kinematic variables 


6Some authors define s dLij/dê dy as FA@) IP (r/x) + For (r/x) for i A j. We will instead denote 
this sum by s dLij+ji/dê dy. 
The NLO expressions are sometimes approximated from the LO by a multiplicative K factor. 
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of the observed particles in F, but also on the factorization scale up (e.g., through radia- 
tion from an intial parton) and on the renormalization scale u introduced in Section 5.4. 
These scales would not appear in physical observables if one could calculate to all orders in 
perturbation theory, but do enter when one truncates at a finite order, introducing a theo- 
retical uncertainty. One typically chooses central values of wr and u near V5 or a transverse 
momentum pr to minimize some of the logarithmic corrections, and estimates the uncer- 
tainties by independently varying them over some range, e.g., by an overall factor of two, 
with 1/2 < ur/p < 2. Of course, the scale dependence is reduced by going to higher orders. 
Large logarithmic corrections arise if there is a significant difference in the magnitudes of 
relevant kinematic quantities. 

As an example of (6.22), consider a massive color-singlet vector resonance V,, of mass 
My and width Ty coupling to fermion Va as La = ~ga Ypa Y" Pa Vu- It is straightforward to 
show (Problem 2.27) that the spin and color-averaged Drell-Yan cross section for aa — bb 
via V is a 2 

1 120(8/M2 Taal yp 


z- .7(8) = 6.25 
Faa—5(8) C2 é _ M2)? + MT?’ ( ) 
where Ca is the color factor (3 for quarks, 1 for leptons), 
= Cag? My 
Taa = ——— 6.26 
127 ( ) 


is the width for V —> aa, and similarly for I’,;, and we have neglected the fermion masses. 
When Iy is small compared to My and to the energy resolution of the detector it is 
convenient to employ the narrow width approximation in (F.3) on page 523, so that 


Faav (8) > ôl — Mp) Taaie l) > Gaav (8) Bos, (6.27) 


where B,; =I ,;/Tv is the branching ratio into bb. Therefore, 


Ar? dlo as 
o(HaHp >V +X)= m 5 Or Ir +r Or 


La 6.28 
3 dé G28) 


s=m2 M 
ŝ=M23 +V 


The formulae are easily extended to inclusive and/or differential cross sections. For 
example, for a two-body final state one can replace o;;(ê) > f dealt?) ap or f dou) 40, 


where % = cos Ê and dQ = ddd, with 6 and ĝ the polar and azimuthal scattering angles of 
one of the final particles in the hard-scattering CM. The hadronic differential cross section 
for HaHp > F + X for a two-body F is then 


=> dLij do;;( 8 ,2) do = dLij do;,;(8, 2) 


= :2 
dêdy dê ’ dé dé a dé dé G29) 


dê dy dê T dz 


One can rewrite the cross section in terms of the (y-invariant) single-particle transverse 
momentum pr introduced in (6.3), pr = psin@ = psin#@, where p is the momentum in the 
parton frame, e.g., p = i 2 if the masses can be ignored. Then, for example, 


=> dLij PT 1 doi; (8, 2) (6.30) 


ds P J ori d 


where døg;;(8, 2)/dz is evaluated at 2 = +4/1 — (pr /p)? (which may be summed). For fixed 


x = 


216 The Standard Model and Beyond 


§ there is an enhancement (the Jacobian peak) at the maximum value pr/p = 1, but the 
integral over p2 is finite. 

Equation (6.13) has been successfully tested and applied in many processes at the Teva- 
tron and the LHC, including Drell-Yan, heavy quark production, high pr jet production, 
and W+ jets (Campbell et al., 2007; Wobisch et al., 2011; Patrignani, 2016). 


Hadron Fragmentation 


Equation (6.13) or (6.22) can adequately describe the hard scattering process. However, the 
final quarks and gluons eventually fragment (hadronize) into mesons and baryons, many of 
which are clustered in jets. For a totally inclusive measurement of the hard process one can 
equivalently consider either the scattering into the partons or into the final hadrons (quark- 
hadron duality). However, for more detailed studies, such as the angular, pr, or number 
distribution of jets, one must first identify them. One may also want to study the details of 
the jets themselves, e.g., to determine whether they are more likely associated with quarks 
or gluons, or with the decays of heavy standard model (e.g., W, Z, t, H) or BSM particles. 
Adequate modeling of both signals and backgrounds may also require knowledge of the 
distributions of the individual hadrons. 

The hadronization can be parametrized by a fragmentation function D?(E;,/Ei, 1"), 
which describes the probability for parton f; to emit hadron h carrying fraction Ep /E; of the 
parton energy; u is the relevant renormalization or energy scale (Ellis et al., 2003; Buckley 
et al., 2011; Metz and Vossen, 2016; Patrignani, 2016). One can model the fragmentation as a 
two step process. First, the initial partons branch or fragment into a number of lower energy 
ones in a parton shower, which can be described by perturbative DGLAP-like equations. 
The initial hard parton is typically very energetic and far off-shell, with a virtual mass 
(virtuality) comparable to that of the final jet. In each branching, however, the final partons 
have lower energy and lower virtuality than the parent. 

The parton-shower algorithms are efficient at describing branchings that are at relatively 
low angle or low energy with respect to the initial parton. Harder emissions are usually dealt 
with by including them as extra contributions to the hard-scattering process, which can be 
calculated to fixed order, and which can themselves subsequently shower. Of course, care is 
required to merge the parton-shower and hard scattering and to avoid double counting. 

Eventually the partons from the showers are of low enough energy, e.g., O(1 GeV), that 
perturbation theory is no longer valid, and one must resort to a phenomenological long- 
distance model, which can be tuned with experimental data, to describe the final formation 
of hadrons. One popular model involves string fragmentation, in which a classical color 
flux tube is formed between a q and qg, and which can absorb gluons. When enough energy 
is stored in the tube it creates a qq pair and breaks, forming two color singlet objects. 
Another popular model is cluster hadronization. Following the parton branching, including 
non-perturbative branching of the gluons into qq pairs, hadrons emerge from neighboring 
qq or other color-singlet clusters. 


Jet Definitions 


One can roughly think of jets as being in one to one correspondence with the partons 
produced in the hard scattering. In practice, however, it is non-trivial to actually identify 
the jets, especially in a hadron collider environment, because it is not always certain which 


8In practice, there are infrared singularities associated with very soft or collinear emissions that can be 
efficiently dealt with by a resummation technique known as Sudakov form factors. 
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hadrons are associated with a particular jet. This can occur when jets partially overlap, or 
because of confusion due to multiple scatterings in the same bunch crossing (pile-up) or 
from hadrons from the underlying event. In fact, there must be some cross-talk between 
jets or with the underlying event to allow the resulting hadrons to be color neutral. 

There are a number of (imperfect) algorithms for identifying the jets in a hard scatter- 
ing, each of which has its advantantages and disadvantages, and which may be employed 
depending on the details of the analysis. One important criterion is that an algorithm should 
be infrared and collinear safe, i.e., insensitive to the emission of soft or collinear partons. 
The major classes are cone algorithms and sequential recombination algorithms. 

The older cone algorithms (Sterman and Weinberg, 1977), extensively used at the Teva- 
tron, define a jet as those hadrons lying within some distance AR in (6.11) from the jet 
center, where in practice the jet center is defined by some iterative process. Not all cone 
algorithms are infrared and collinear safe, especially at higher orders, so care had to be 
taken in their use. 

Most LHC, ete and ep analyses utilize sequential recombination algorithms, in which 
each pair of high pr particles i and j is characterized by a “distance” dij, and each particle 
also has a distance d;g from the incident beams. Jets are identified in a multistep process: 


Find the minimum of di; and dig for all i, and j. 


If the minimum is for pair dj; then 7 and j are combined into a pseudo-particle. If it 
is a dig then define 7 as a jet and remove it from further consideration. 


Iterate this process until all of the initial particles have been merged into jets. 
The most common sequential recombination algorithms at hadron colliders define 
AR? 


dip = vi, (6.31) 


dij = min(p7 , prs) 


where AR;; is the distance in the y — y or 7 — ọ plane defined in (6.11), n is an integer, 
and R is a conveniently chosen constant. n = +1, 0, and —1 are known, respectively, as the 
kr, Cambridge-Aachen (C/A), and anti-kr algorithms, all of which are infrared /collinear 
safe. The kr algorithm combines the soft particles first, and models in reverse the likely 
sequence in which the particles were emitted in a parton shower, while the C/A algorithm is 
based entirely on the angular separation. However, both lead to irregularly-shaped regions 
in the y—y plane that are awkward to deal with. The anti-kr algorithm merges the hardest 
particles first. Its physical interpretation is less clear, but it leads to more regularly-shaped 
jet boundaries and is most often employed at the LHC. For specific purposes, however, it 
is sometimes useful to utilize other definitions. 

For a much more detailed discussion of these and related issues, see (Moretti et al., 1998; 
Ellis et al., 2008; Salam, 2010b; Sapeta, 2016; Patrignani, 2016). 


Jet Characteristics and Boosted Decays 


Jets can emerge not only from pure QCD processes, but also from the decays of heavy 
standard model particles, i.e., top quarks, which are too short-lived to directly hadronize, 
electroweak gauge bosons (V = W=, Z), or the Higgs boson H. Heavy BSM particles usually 
also lead to jets, either produced directly or through intermediate t,V, or H decays. It is 
therefore important to characterize the jets to determine their likely origin. This is a vast 
subject and we can only mention a few aspects here. 

Jets may be characterized by their substructure, referring to the details of the branchings 
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in the parton shower, and by various jet shape variables, describing properties such as their 
mass, angular distributions from the direction of the jet, and number of subjets. 

Ordinary QCD jets resulting from light quarks and gluons tend to shower asymmetri- 
cally, with most of the energy in each step going to one of the daughters (e.g., because the 
other is a soft gluon). The angles tend to be small, and the virtuality of the harder parton 
decreases gradually. Because of different color and splitting factors gluon jets tend to be 
broader and have more soft radiation than light quark jets.? Bottom quark jets, which are 
extremely common at the LHC, can be efficiently tagged by criteria involving their mass, 
lifetime, and B decay products (e.g., Voutilainen, 2015). 

The energy at the LHC is sufficiently large that the t, WF, Z, and H may be produced 
with pr (or energy) much larger than their mass (O(100 — 200 GeV)). This is true in parts 
of the phase space for standard model production processes, and is even more likely if 
they result from the decays of very heavy (e.g., m > TeV) BSM particles. The leptonic or 
hadronic decay products of such highly boosted particles will be highly collimated. Consider, 
for example, a tt pair, each decaying hadronically, t + bqg'. If the tt pair is produced at 
rest, one will typically observe six jets, distributed more or less isotropically. If they are 
sufficiently boosted, on the other hand, the jets will be collimated into two groups, perhaps 
even appearing to be only two fat jets, i.e., the event shapes will be very different. Similarly, 
for a W~ decaying to y` D, one cannot in general reconstruct the longitudinal momentum 
of the invisible p. However, if the W~ is highly boosted one has the additional constraint 
that the 2, momentum is nearly parallel to that of the u7. 

Highly boosted decays and detailed studies of the substructure of the resulting jets often 
allow methods to identify the decaying particle, increase the signal to background ratio, and 
reduce combinatoric backgrounds. We illustrate by describing a seminal strategy to search 
for the decay of the Higgs boson into bb that inspired a great deal of effort on jet substructure 
(even though the recipe was not followed in detail by the initial ATLAS or CMS analyses). 
For My ~ 125 GeV, bb is expected to be the dominant decay mode, with a branching ratio 
of nearly 60% (Section 8.5). However, the H — bb rate is swamped by QCD and other 
backgrounds at the LHC, even when the Higgs is produced in association with a Z or WS. 
To ameliorate this difficulty it was suggested in (Butterworth et al., 2008) to concentrate 
on highly boosted events in which the V and the H are nearly back to back, each with 
pr = 200 GeV. Even though the signal would be greatly reduced, the background would 
be even more so. In that region the jets from V + H — V + bb or from backgrounds such 
as V +G — V +b would appear to be a single fat jet. Relevant events could be identified, 
e.g., by applying the C/A algorithm with a large R ~ 1.2 to capture most of the radiation 
and tagging the Z or W by its leptonic decays. 

The next step is to determine whether the event is more likely due to a Higgs, by 
undoing (or unwinding) the jet recombination sequence one step at a time, starting from 
the last. Each such step involves the branching of a parent jet into two subjets. If there 
is a significant mass drop (i.e., virtuality drop) between the parent and the more massive 
subjet, the splitting is not too asymmetric, and the subjets are b-tagged, then one considers 
the event as a Higgs candidate with its decay into the two quark jets occurring at that 
branching. If the first two criteria are not satisfied, one discards the less massive subjet and 
considers the subsequent branching of the more massive one. If one reaches the end of the 
chain without finding a candidate branching, the event is considered to be background. 

The large R value virtually guarantees that the fat jet will be contaminated by radiation 


°For example, the probability of an ordinary quark or gluon radiating a soft gluon of energy E at a 
small angle 0 is ~ Zaa dE d2 where C = 4/3 (quark) or C = 3 gluon is the quadratic Casimir of the 


E ð 
emitter (e.g., Salam, 2010a). 
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from the underlying event, decreasing the resolution on the Higgs mass. Some of this is 
discarded by the unwinding process. However, the event can be further groomed (cleaned 
up) by an additional filtering step, in which the jet-finding algorithm is applied again, but 
this time with a much smaller R (e.g., min(0.3, Rẹ5/2), where R,z; is the distance between 
the quark subjets). Keeping only the three hardest jets (to allow for the radiation of a hard 
gluon from one of the quarks) and applying b tags to the two hardest results in a sample 
of fairly clean candidate events with a significantly enhanced signal to background (i.e., 
S/V B) ratio. 

Many other jet substructure, jet event, and jet grooming (e.g., “pruning” and “trim- 
ming”) techniques relevant to boosted objects, with applications to Higgs, electroweak 
boson, top, and BSM physics, are described in detail in (e.g., Abdesselam et al., 2011; 
Altheimer et al., 2012; Shelton, 2013). 


Vector Boson Fusion 


We have mainly been concerned with the hard scattering of quarks and gluons. However, 
the electroweak vector bosons y, W+, and Z can also be radiated from the incident p or p 
(or from the e® in e*e7~ or ep colliders), and in some cases can be thought of as additional 
partons. 

First consider e~e*. The dominant process at low energies is s-channel annihilation 
through a photon. However, the cross section falls as 1/s, as in (2.232) on page 46. There 
are also two-photon diagrams for e~ e+ — e~e+ F, where F can be (~ +, 7°, ntr, +++, as 
shown in Figure 6.7. We encountered such diagrams in considering deep inelastic scattering 
from a quasi-real (nearly on-shell) photon in Section 5.6, but here we are more interested 
in the case that both photons are quasi-real. Because of collinear singularities these cross 
sections grow in energy as a power of Ins and eventually dominate at high energy despite 
the extra power of œ in the amplitude. To a good approximation the processes can be 
described by the equivalent photon approximation, that is, by a integral of the cross section 
Oyy+F(k1, k2) for two real transversely-polarized photons of energies k;,2 to scatter into F, 
weighted by the probabilities for the beam particles to radiate them (cf. Equation 6.13). 
See (Brodsky et al., 1971; Nisius, 2000) and the Cross-Section article in (Patrignani, 2016) 
for detailed discussions. 

Similarly, one can consider the radiation of V = W= or Z from an incident quark (or 
lepton) at a hadron (or lepton) collider with energy ys > My, with V scattering from a 
constituent of the other beam particle. At high enough energies this can be approximated by 
the effective W (Z) approximation (Kane et al., 1984; Dawson, 1985; Chanowitz and Gail- 
lard, 1985), i.e., as a flux (“parton distribution function” ) of nearly on shell transverse (Vr) 
or longitudinal (Vz) vector bosons times a cross section. In particular, vector boson fusion 
(VBF) describes the process VV’ > F illustrated in Figure 6.7. The WW- or ZZ > H 
processes were important in the Higgs discovery, while W,W, —> W,Wy will be described 
in Section 8.5. The detailed kinematics implies that VBF will typically be accompanied by 
energetic forward jets associated with the beam remnants, while the color-singlet nature of 
the V means that there should be a rapidity gap, i.e., relatively little hadronic activity in 
the central region other than from the F' (e.g., Han, 2005). 


Collider Observables 


Many observables for testing the standard model and searching for new physics are possible 
at hadron and other colliders, some of which have already been touched on or will be 
mentioned in subsequent chapters. These include: 
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Figure 6.7 Left: two-photon diagram for eet —> e-et7®. Similar diagrams could 
produce other final states, such as £~€* or t+ a~. Right: vector boson fusion (VBF) 
diagram for pp + F+ forward jets, where V and V’ can be WF or Z. Similar 
diagrams are possible for e~e* at sufficiently high energy. 


Jet observables, such as single (inclusive) jet production as a function of pr and y or n, 
including boosted jets, jet shape, substructure, composition, and flavor tagging; dijet 
production as a function of the dijet mass and various angular separations between 
them; distributions in the numbers and event shapes (Banfi et al., 2010) of multiple 
jets; forward jets; rapidity gaps; and production in association with other particles. 


Production of single leptons (presumably in association with missing pyr), lepton 
pairs, multiple leptons from BSM particle decays, or collimated lepton jets from 
boosted decays (Arkani-Hamed and Weiner, 2008). Lepton pairs can be same sign 
(SS), opposite sign (OS), same flavor (SF), or opposite flavor (OF). For example, the 
Drell-Yan process qq — y, Z — €*€~ should produce OSSF dileptons. 


Preferential production of third family particles (t, b, T), especially in BSM theories 
associated with electroweak symmetry breaking. 


Resonances (bumps in the mass distributions of, e.g., dilepton or diquark pairs). These 
may be due to the production and decay of new BSM particles such as a new U(1)! 
gauge boson (Section 10.3.1.), and may be described by a Breit-Wigner distribution 
or in the narrow width approximation (Appendix F). 


Angular distributions, forward-backward asymmetries, and charge asymmetries, e.g., 
in dilepton or dijet distributions. 


Spin correlations, polarizations, and possible T-violating observables. 


Associated productions of BSM particles, due to new conserved quantities such as 
R-parity in supersymmetry (Section 10.2). 


Cascade decays, i.e., multistep decay processes of very heavy BSM particles, often 
associated with kinematic edges and missing energy, such as can occur in supersym- 
metry. 


Missing transverse energy, e.g., due to unobserved heavy stable or quasi-stable parti- 
cles. 
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Stable (on collider time scales) strongly interacting particles that either pass out of the 
detector or stop, possibly decaying much later. Examples include R-hadrons (Kraan 
et al., 2007; Kang et al., 2008), which are gluinos or heavy exotic quarks bound into 
hadrons with ordinary quarks, and stabilized by R-parity. 


Displaced vertices due the the decays of long-lived particles, such as b quarks or neutral 
particles in hidden valley models (Strassler and Zurek, 2007). 


These and other signatures, and their motivations from BSM physics, are reviewed 
in (Alves et al., 2012). 


6.3 PROBLEMS 


6.1 (a) Consider an idealized colliding beam experiment in which bunches of particles 
moving in the z direction collide head-on with frequency f. Show that the instantaneous 
luminosity defined on page 206 is given by 


N,N: 
isit / n(z,y)?de dy = PAN, 


where dı, are the lengths of the bunches, and n(x, y) is the number density of each bunch 
as a function of the transverse directions, which is assumed to be the same for each beam 
and independent of z within the bunch. Nj,2 are the total number of particles in each bunch, 
and the effective transverse area A is defined by the second form. 

(b) Suppose n(a, y) is a constant n over a transverse area A and zero outside. Calculate A. 
(c) Show that A = 470,0, for the more realistic Gaussian density profile 


n(a,y) x e Eiee aa, 


where Cg y are the RMS radii. 


6.2 There are a variety of Fortran, C++, and Mathematica packages available at 
www.hep.ucl.ac.uk/mmht/ or other websites listed in the bibliography for generating par- 
ton distribution functions obtained from global fits to the data at LO, NLO, and NNLO. 
Download one of these packages and use it to (qualitatively) reproduce the central values 
in Figure 5.15. The same package can be used for subsequent problems. 


6.3 Plot dLi;j/dê dy in pb for ij = tu, uti, uu, Gu, and GG as a function of y at the LHC 
(14 TeV) for v8 = 1 TeV. 


6.4 (a) Calculate the cross sections for pp > tt at \/s = 2 TeV (Tevatron) and for pp > tt 
at 8 and 14 TeV (LHC). Give the individual contributions of gg (summed over q = u, d, s, c) 
and GG. The relevant parton level cross sections were calculated in Problem 5.6, can be 
found in (Patrignani, 2016), or can be inferred from (2.232) for gg. Use LO PDFs and 
neglect the running of a, ~ 0.1. 

(b) Plot the contributions of gg, GG, and their sum to do(pp > tt)/dy (in units of pb) and 
to the t-quark pr distribution do(pp — tt)/dpr (in pb/GeV), both at 2 TeV. 


6.5 Suppose there exists a spin-0 resonance R with mass Mp = 1 TeV and width T'R < 
Mr, which couples to gluons, photons, wt, and dd, with partial (spin and color summed) 
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decay widths Ieg, Dis Pua, and Iag, respectively. Show that the cross section for pp > 
R — yy in the narrow width approximation (Appendix F) is of the form 


o = [Aga Baa + ua Bua + AgGBaal By; 


where By; = Ti; /Tp is the branching ratio into ij. Ignore any contribution of yy to the 
production. Give explicit expressions for the \;; in terms of the parton luminosities, Mp, 
and Tr, and calculate their numerical values in fb at LO for the (14 TeV) LHC, assuming 
Tr =10-*Mp. Be careful of the color and identical particle factors. 


6.6 Perhaps the best known event shape variable is thrust, 


T = max Dai (Pi PI 
ae Dulal 
where the sum is over all of the particles in the event, and the thrust axis 7, is chosen 
to maximize the sum. (At hadron colliders one can replace p; by pyr.) Calculate 7 for 
(a) two back-to-back particles (or highly-collimated jets); (b) many particles, distributed 
isotropically; (c) three particles in a plane with equal energies and separated by 120°. 


CHAPTER 7 


The Weak Interactions 


DOI: 10.1201/b22175-7 


In this chapter we discuss the charged current weak interactions. After a short overview 
of the developments that led to the standard model, we describe the Fermi theory, which 
in its modern form still gives an excellent description of a wide variety of weak decay 
and scattering processes at tree level. (In some cases, radiative corrections, which require 
the full structure of the standard model, are required.) Some representative processes are 
calculated and described. Much more detailed treatments may be found in, e.g., (Commins 
and Bucksbaum, 1983; Renton, 1990; Langacker, 1995; Patrignani, 2016). 


7.1 —_ORIGINS OF THE WEAK INTERACTIONS 


We first give a brief history of some of the highlights in the history of weak interactions 
and the development of the standard electroweak model (for a professional history, see Pais, 
1986). The story begins with the discovery of radioactivity by Henri Becquerel in 1896. 
One of the three types of radioactive decays that were identified was nuclear 8 decay, in 
which, apparently, (N, Z) > (N —1,Z + 1)e7~ (or in modern terms, subsequent to the 
discovery of the neutron, n — pe~). By 1914, experiments by Chadwick established that 
the e~ spectrum was continuous, suggesting that 6 decay violated the conservation of 
energy. As we now understand it, momentum and angular momentum would also have been 
violated. In 1930 Wolfgang Pauli speculated that the missing energy was carried off by a 
hypothetical weakly coupled neutral particle, dubbed the neutrino by Enrico Fermi. The 
neutrino (or anti-neutrino) would be difficult to detect because of its very weak interactions, 
but not impossible. Its existence was confirmed in 1956 when the electron antineutrino, De, 
was directly observed near a reactor by its rescattering to produce a positron via inverse 8 
decay, Pep — e*n (Cowan et al., 1956). The second neutrino type, associated with the muon, 
was discovered by a group headed by Lederman, Schwartz, and Steinberger at Brookhaven 
in 1962, who observed its rescattering to produce a muon (Danby et al., 1962). Following 
the observation of the t* in 1975 (Perl et al., 1975), the existence of the third, tau-type, 
neutrino, v;, was unambiguously inferred from the 7 lifetime and decay properties (see, 
e.g., Langacker, 1989b). However, it was not observed directly until 2000 when the DONUT 
collaboration at Fermilab observed its rescattering to produce r+ (Kodama et al., 2001). 

In 1934 Fermi proposed a theory of 8 decay, n + pe~ De, which loosely resembles QED 
but involves a zero range (non-renormalizable) four-fermion interaction and non-diagonal 
charged currents, as shown in Figure 7.1. The Fermi Hamiltonian density is 


H = Gp)", (7.1) 
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where 
J = PYN + Veype (7.2) 


is the charge-raising vector current, which describes the transitions n —> p and e~ > ve. 
The charge-lowering current is 
Jy = AYuP + EYpVe, (7.3) 


which describes p > n and ve — e7 (or the creation of an e~ De pair). The coefficient 
Gp ~ 1.17 x 107° GeV? ~ 1.02 x 1075m? (7.4) 


is the Fermi constant, which describes the strength of the interaction. It has dimensions 
of mass~?, characteristic of a non-renormalizable theory. In amplitudes it is typically mul- 
tiplied by A?, where A = O(MeV) is a characteristic energy release, so that indeed the 
interaction is very weak. 


e- 
P y e Ve ue 
n Ve e- Ve 
Figure 7.1 Left: four-fermion interaction leading to 8 decay. The vertices are at the 


same spacetime point but are displaced for clarity. Middle: four-fermion interaction 
for vee~ — vee. Right: a higher-order correction to Vee™ > Vee™. 


The original Fermi theory described 8 decay and related processes. It has been suc- 
cessfully modified over the years to incorporate new observations,! including parity vio- 
lation (Lee and Yang, 1956; Wu et al., 1957) and the V — A theory (Feynman and Gell- 
Mann, 1958; Sudarshan and Marshak, 1958); u and 7 decays; strangeness changing de- 
cays [Cabibbo mixing (Cabibbo, 1963)]; the quark model; heavy quarks and mixing [the 
Cabibbo-Kobayashi-Maskawa (CKM) matrix, which allows the incorporation of CP viola- 
tion (Kobayashi and Maskawa, 1973)]; and neutrino mass and mixing. In its modified form 
it still gives excellent tree level descriptions” of a wide variety of charged current mediated 
decays and scattering processes, including: 


Nuclear/neutron 8 and inverse 8 decay (n > pe"; Ven +e p; Ee p— Ven) 


H, T decays (u7 —> e7 Debu; Th Dye, Vrn, +++) 


Tm, K decays (rt > tvp, mete; Kt > pty, nPetve, nn’) 


hyperon decays (A > pr™; E` 4 nn7; Ut + Aetve) 


1See (Langacker, 1981; Commins and Bucksbaum, 1983; Pais, 1986) for a more complete list of references. 

2Some of the processes are now well enough measured that one needs to apply higher-order elec- 
troweak (e.g., Sirlin and Ferroglia, 2013) and QCD (Buras, 2011) corrections, which often require the 
full renormalizable standard model to make sense. 
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heavy quark decays (c > setve; b— cu, cm) 


v scattering (Vue > W ve; Vun >u p; VaN >u X) 
r — S 
“elastic” deep—inelastic 
However, the Fermi theory violates unitarity at high energy, reflecting its non- 
renormalizability. For example, the cross section for y%.e~ + eve grows with energy when 
computed at tree level. In the modern V — A version the middle diagram in Figure 7.1 yields 


G28 
one > eve) > =, s= Ebm. (7.5) 
T 
This is a purely S-wave process, so S-wave unitarity (Appendix C) requires o < 16r, 
Unitarity therefore appears to fail for 

E T 
CM > [= ~ 500 GeV. (7.6) 

2 Gr 


This is not by itself so serious. In non-relativistic potential scattering, for example, the 
Born approximation is not unitary, but unitarity is restored by higher-order terms. For 
the Fermi theory, however, higher-order contributions are divergent, again due to the non- 
renormalizability. For example, the diagram on the right in Figure 7.1 involves the integral 


OO] cm 


which diverges quadratically for k — oo (we have neglected the e~ mass and the external 
momenta, which is valid for large enough k). Although vee™ — e~1 has only been measured 
at low energies, there is clearly a theoretical inconsistency; the Fermi theory cannot be the 
full story. 

In the intermediate vector boson theory (IVB) the four-fermion interaction was elim- 
inated (Yukawa, 1935; Schwinger, 1957). Instead, it was assumed that the process was 
mediated by a spin-1 particle,? analogous to the photon in QED. However, the intermediate 
bosons W= were assumed to be very massive (compared to the energies of the experiments) 
and electrically charged, as indicated in Figure 7.2. The coupling to fermions is given by 


L = gW} set + gW7 J, 7.8 
H H 


where g is the coupling strength. For Mĝ& > Q? = —q?, where q is the momentum transfer, 
the denominator q? — M8, of the W propagator can be replaced by —Mj,, and one has 
effectively a four-fermion interaction. We will see that this reproduces the Fermi theory at 
tree level with the identification 


Gr o 
Va 8MR, 


(the factors of 2 and V2 will become apparent). However, the full propagator leads to a 
better behaved amplitude for vy.e~ —> e~v, at high energies. 

Unfortunately, the difficulties reemerge when we consider processes involving an external 
W. As was briefly discussed in Section 2.5.1, a vector boson with an elementary mass term 


for My >Q (7.9) 


3Yukawa actually suggested (in the same paper as the meson theory) that the proposed charged spin-0 
meson could also have a weak coupling to leptons, leading to 8 decay. 
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Figure 7.2 Top: intermediate vector bosons mediating 8 decay and vee™ —> e` i. 
Bottom left: diagram for et'e~ — WtW- in the IVB theory. Bottom right: addi- 
tional diagram in the SU(2) theory. 


leads to a non-renormalizable theory, and amplitudes involving the longitudinal degree of 
freedom are badly behaved at high energy due to the polarization vectors elk, 3) ~ ky /Mw. 
In particular, the amplitude for ete~ — W*W violates unitarity for v/s = 500 GeV due 
to the diagram on the lower left in Figure 7.2. 

It is possible to resolve these problems by adding more particles in such a way that the 
bad high energy behaviors cancel. For example, one may add a massive, electrically neutral 
W? boson, so that there is an additional diagram for ete~ — W+W~, as shown on the 
bottom right in Figure 7.2. Choosing the neutral current J° that describes the W°-fermion 
coupling and the triple-vector W°W*W~ coupling so that not only ete — WtW- but 
also related amplitudes like vee” — W°W~ are well behaved, one obtains [J, Jt] x J? and 
3- and 4- point vector vertices that are equivalent to an SU(2) gauge theory! Thus, one 
can regard the gauge invariance as a necessary consequence of well-behaved high energy 
amplitudes. 

The SU(2) model just described has no room for electromagnetism and is not realistic. 
It was extended by Sheldon Glashow in 1961 (Glashow, 1961) to an SU(2) x U(1) model, 
with the y and the prediction of a second neutral boson (the Z) as well as the W=. The 
gauge sector of the Glashow model is in fact the standard model, but at the time there was 
no satisfactory mechanism for generating masses for the WF and Z. These had to be put 
in by hand, resulting in difficulties for the behavior of cross sections such as WtW- > 
WtW7 at high energy. This was remedied by Steven Weinberg in 1967 (Weinberg, 1967a) 
and independently by Abdus Salam (Salam, 1968), who invoked the Higgs mechanism to 
generate their masses and those of the chiral fermions by spontaneous symmetry breaking. 
Weinberg speculated that the SSB would preserve the renormalizability of the theory. This 
was proved by ’t Hooft and Veltman and others in 1971 (’t Hooft, 1971a,b; ’t Hooft and 
Veltman, 1972; Lee and Zinn-Justin, 1972). 
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The original Weinberg model considered leptons only, in part because the quark model 
was not well-established. However, quarks would not have led to a satisfactory extension 
to hadrons, because only the u,d, and s quarks were known at the time. One could de- 
scribe the dominant d + u transitions by assuming they transformed as an SU(2) doublet. 
The s would have to be an SU(2) singlet because it had no charge-2/3 partner. However, 
somewhat weaker s © u charged current transitions were observed experimentally, so there 
would have to be d — s mixing. That in turn led to strangeness changing neutral currents, 
i.e., d— s transitions mediated by the Z, which had not been observed experimentally. In 
particular, both the tree-level and loop diagrams in Figure 7.3 would lead to K? 4 K? mix- 
ing much larger than what was observed. This difficulty could be remedied if there existed 
a fourth, charge-2/3 charm (c) quark. The c and s could transform as an SU(2) doublet. 
With the d and s transforming the same way, the off-diagonal Z vertex disappeared, and 
the box diagram was strongly suppressed by the cancellation of the largest parts of the c 
and u exchange amplitudes [the GIM mechanism, (Glashow et al., 1970)]. An early esti- 
mate (Gaillard and Lee, 1974b) of the necessary mass, Mme ~ 1.5 GeV, was very close to the 
actual value (see (5.124) on page 194). 


s RK d s R’ d 
W 
Z u u 
W 
d K `s d K” a 


Figure 7.3 Left: K? 4+ K? mixing induced by strangeness changing neutral current 
vertices in the 3 flavor SU(2) x U(1) model. Right: box diagram contribution. 


However, physicists at the time were reluctant to entertain the possibility of additional 
quarks (after all, the quark model had been invented to simplify the hadron spectrum). 
In 1974, however, the J/wW resonance was discovered simultaneously at Brookhaven and 
SLAC, and it was soon tentatively identified as a cé bound state. The existence of charm 
was subsequently confirmed by the identification of singly-charmed mesons and in neutrino 
scattering (see, e.g., Gaillard et al., 1975; Rosner, 1999). 

The (flavor diagonal) weak neutral current processes mediated by the Z were discovered 
at CERN and Fermilab in 1973, and extensively probed experimentally in the 1970s and 80s. 
When combined with the observation of charm, the basic ingredients of the standard model 
were in place. QCD was established during the 1970s as well. The W and Z were discovered 
with the expected masses at CERN in 1983, and high precision tests of the Z interactions 
at the level of loop corrections were carried out at the LEP and SLC colliders at CERN 
and SLAC, respectively, from 1989-2000. The loop corrections allowed the prediction of the 
top quark mass, which was confirmed by its direct discovery at Fermilab in 1995. Since 
~ 1995 precise tests of the unitarity of the CKM matrix and verification that it describes 
the observed CP violation were carried out in B physics experiments, especially at Cornell, 
SLAC, KEK, and later at the Tevatron and the LHC. Finally, hints of nonzero neutrino 
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mass from solar and atmospheric neutrino experiments were confirmed in 1998, and the 
neutrino sector intensively studied since that time. The Higgs boson, the final ingredient of 
the standard model (extended to include neutrino mass), was discovered at CERN in 2012. 


7.2 THE FERMI THEORY OF CHARGED CURRENT WEAK INTERACTIONS 


Let us now describe the Fermi theory of charged current interactions in more detail, mainly 
in the V — A form that was developed prior to the standard model (the extension to 
include charm and the third family is straightforward). More detailed discussions of the 
experiments in 8, y, n, K, and hyperon decays that led to its development, including analyses 
in a more general framework allowing general S, P,T,V, and A interactions, may be found 
in (Commins and Bucksbaum, 1983; Renton, 1990; Langacker, 1995; Severijns et al., 2006). 
The Hamiltonian density is 


Gp 
H = Ji ye, 7.10 
p” (7.10) 
where the Fermi constant Gp is given in (7.4). The v2 factor compared to (7.1) is due to 
the modification from V to V — A currents. The charge raising current is given by the sum 
of leptonic and hadronic currents, 


JI =a ed (7.11) 
The leptonic current is 
Jt = Dery (1-7) € + Puy (1—7) w= 2 (DeL Yer + Purple), (7.12) 


where of course Yr = Pry = =F The V, — A,, (i-e., y(1 — 7°)) form corresponds to 
the maximal amount of parity and charge conjugation violation. One can add 2v7 YuTL to 
(7.12). The leptonic charge lowering current is 


Ji =e (1 — 7) Ve + Pp (1 — 7°) My = 2 (€n%uMen + BLYuYuL) - (7.13) 
The simple extension of (7.2) and (7.13) for the hadronic current would be 
TM ~ Pyp (1— 75) ncos Be, Ih ~ nyu (1 — 7°) pcos Be, (7.14) 


where the cos@, Cabibbo angle factor will be discussed below. This would describe 8 decay. 
However, there are other observed hadronic transitions, including hyperon decays such as 
E- — n, and meson decays involving t+ > 7°, K+ > 7°, or (K+, nt) > vacuum. These 
could be described by adding additional terms to (7.14). However, a much simpler and 
more universal form is obtained by writing it in terms of quark fields. The form prior to the 
discovery of the c,b, and t quarks was 


an = Up (1 — 7) d = 21 yd, J} = dyn (1 — 7) u= 2d Y UL, (7.15) 
where a sum over color is implied and d’ is a rotation of the d and s quark fields 
d' = dcos@, + ssin be. (7.16) 


The angle ĝe is the Cabibbo angle, with value sin ĝe ~ 0.23. It describes the mismatch 
between the weak eigenstate d', i.e., the field that couples to the u quark in i, and the 


The Weak Interactions 229 


mass eigenstates d and s. tan ĝe measures the ratio between strangeness changing, AS = 1, 
and strangeness conserving, AS = 0, transition amplitudes, such as ©” — ne» and 
n — pe De, respectively. The fact that the hadronic and leptonic currents have the same 
strength, up to the rotation, reflects Cabibbo universality. That is, the squared amplitudes 
for y > Vue Ve, n > pe Ve, and U~ —> ne~ De are all different, but they are universal in 
the sense that if one could ignore strong interaction and mass effects one would have 


|M(u- > vpe De) |? = |M(n > pe z)|? + |M (© > nev) |”, (7.17) 


since they scale as 1 : cos? ĝe : sin? ĝe ~ 1: 0.95: 0.05, in agreement with the observed 
strengths. This result emerges naturally in the quark theory, where it is seen to result from 
the simple rotation in (7.16), but is more mysterious otherwise. Universality generalizes 
easily to the 4 and 6 quark cases (CKM universality), and in the standard model is seen to 
be the result of a universal gauge interaction combined with family mixing. 

Equation (7.15) incorporates the empirical AS = AQ rule for strangeness changing 
transitions. For example, in the observed X~ — n transition both the strangeness and the 
hadronic electric charge increase by one unit. However, the AS = —AQ transition Ut > 
ne*v, is not observed in nature. Similarly, despite many experimental searches strangeness 
changing neutral currents (or flavor changing neutral currents (FCNC), as they are now 
called) have never been observed, with the exception of processes that are consistent with 
being of higher order in the electroweak interactions. Thus, there are no s > d transitions, 
which could lead, e.g., to X° — nete~. This is illustrated by the weight diagrams in 
Figure 7.4. 


Figure 7.4 Weight diagrams for the baryon octet and for the quarks. The hadronic 
charge raising current J’! mediates transitions to the right (AS = 0) or diagonally 
up and to the right (AS = AQ), but not diagonally up and left (FCNC) or to 
non-adjacent states (AS = —AQ). 


Using (7.10), (7.12), and (7.15), the four-fermion Hamiltonian is H = f d°zH, where 


H= E (I + JH) (JH + J) = Hi + st + Hn. (7.18) 


In (7.18) the leptonic Hamiltonian density is 


(= Dm (1— 7”) 3 (= Eng” (1-4) a] (7.19) 


Gr 
H= E 
= A2 
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where em = e or pt. It is responsible for such purely leptonic decays and processes as muon 
decay (42 — v,e~ De) and inverse muon decay (v,e~ — pv Ve). The semi-leptonic density 
is 


(= Pm (L— 7") on] (ar dan) u) 
(rtv) (geert) 


It is responsible for decays of hadrons into final states that include leptons, such as n > 
pe“ De, U > ne De, Kt > pty, and Kt + etve, as well as for neutrino scattering 
and inverse 8 decay, ven < ep. In the extension to three lepton families it also allows 
semi-hadronic 7 decays, such as T~ > v n~. The non-leptonic term 


Hu = E (an - P) ) (y" (0-79) 0) (7.21) 


GF 
Hası = 
V2 
(7.20) 


drives decays not involving leptons, e.g., Kt > ntr?, K+ > ntron’, Et > pr®, and 
A? — nT? (Commins and Bucksbaum, 1983). It can also generate a parity-violating pertur- 
bation in the NN interaction (Ramsey-Musolf and Page, 2006; Haxton and Holstein, 2013; 
de Vries et al., 2014), leading to effects in polarized pp scattering, nuclear transitions, and 
an electromagnetic anapole (77°) moment (Haxton and Wieman, 2001). It is difficult to 
calculate matrix elements of Hn, reliably because two hadronic currents are involved.* 

There is an obvious asymmetry between the quarks and leptons in the 3 quark case. That 
was remedied subsequently with the discovery of the charm quark, which could partner with 
the s as an additional term in the hadronic current 


Al = Un (1 7) d+ Op (1 a7) 8 = Wp Yud, + 2er WSL, (7.22) 


where 
s’ = s cos ĝe — dsin be. (7.23) 


One can also extend to a third family. Then, 


e d 
J = (DePu Pr) Yu (1 — a) u | + (ety — Voku S Jo (7.24) 
T b 


where Voxy is the 3 x 3 unitary CKM quark mixing matrix. Empirically, the terms in 
the CKM matrix that mix the third family with the first two are small and can usually be 
ignored when considering c, s, or d decays. (They are of course critical for b decays.) These 
extensions will be mentioned in Section 7.2.7 and (along with a further extension to include 
neutrino mass) described in the context of the full standard model in Chapters 8 and 9. 


“For example, the AS = +1 part can be decomposed into operators with total isospin 1/2 and 3/2. 
Empirically, the kaon and hyperon decay rates involving the isospin AJ = 1/2 piece are much larger than 
those driven by the AI = 3/2 term, i.e., the AJ = 3 rule, or the octet rule from the SU(3) perspective (see, 
e.g., Commins and Bucksbaum, 1983; Donoghue et al., 2014). Especially puzzling are the K — 2m decays, 
where the ratio of the relevant amplitudes is around 20. Short-distance QCD corrections can account for a 
factor of ~ 3 (Gaillard and Lee, 1974a; Altarelli and Maiani, 1974), but the rest must be due to long distance 
non-perturbative effects. There has been some recent progress understanding this from analytic 1/Ne (Buras 
et al., 2014) and lattice calculations (Boyle et al., 2013), and from the AdS/CFT correspondence (Maldacena, 
1998; Hambye et al., 2007). 
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The charged current Fermi Hamiltonian is of the V — A type, i.e., each term is of the 
form 


where 


Vad (t, T) = Walt, T )ypo(t, =), Anab (t, T) = Walt, Z) wv (t, T) (7.26) 


are, respectively, vector and axial currents. We saw in Section 2.10 that under space reflec- 
tion 


Vaablt, Z) + PVyar(t,Z)P~* = Vh (t, —7) (727) 
Anablt, Z) > PAnablt, Z)PT' = —A}, (t, —2). l 
Changing the sign of the dummy integration variable and using A“ B, = A,B", 
H > PHP! 
Gr (7.28) 


= ea  (Yaaal Z) + Ayal(t, ay (väle, Z) + AH, (t, z)) ze re] l 
i.e., the interaction changes to V + A and the VA interference terms change sign. Parity 
is violated maximally, as is manifested by the fact that left-chiral fields (corresponding to 
left-handed particles and right-handed antiparticles in the massless limit) participate in 
weak transitions but right-chiral fields do not. It was not initially suspected that parity 
was not conserved, and in fact most physicists took its conservation for granted.” However, 
in the mid 1950s, the decays K+ — mtr? and K+ — r+n~7° were both observed. Since 
the K* has spin-0 and the pion has negative intrinsic parity, the first mode would require 
an even intrinsic parity ņng+ = +1 if parity were conserved, while the second would imply 
ng+ = —1. This led Lee and Yang to reexamine the question of whether parity was 
conserved in the weak interactions (Lee and Yang, 1956), and soon thereafter C. S. Wu et 
al. established parity violation by observing an asymmetry in the direction of the emitted 
electron w.r.t. the nuclear spin direction in the 8 decay of polarized ®°Co (Wu et al., 1957). 
Similarly, under charge conjugation 


Viuiab > Vial = —Vuba = -Vi o Anab —> CAparC* = +Anba = +Al a (7.29) 
Thus a i i 
H => CHC™! = [ees (ce + Awiz) (Vs + Abs) F he] . (7.30) 


Charge conjugation therefore changes V — A to V +A, just like space reflection. This implies 
that only left-handed particles and right-handed antiparticles are involved in weak charged 


5Except for Dirac, who “did not believe in it” (Pais, 1986, p.25). 

6At first, it was thought that the new modes represented the decays of two distinct particles, 9 > 
ntn’ and T + ntr r?, but experiments indicated that they must have the same mass, spin (= 0), and 
lifetime, but opposite parity. This unexpected state of affairs was known as the 7 — 0 puzzle (Commins and 
Bucksbaum, 1983). 
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current transitions. Even though C and P are each violated maximally, the product CP 
restores H to the V — A form and CP is conserved, 


(CP)H(CP)* =H. (7.31) 


For example, the charged current weak interactions of the e; are the same as those of the ek, 
while eg and ef do not enter the charged current. Actually, there is observed to be a small 
amount of CP violation in nature, much weaker than the normal weak interactions. This 
cannot be accommodated in the Fermi theory, or even its extension to 4 quarks. However, 
as we will describe in Section 8.6, it can occur in the three family theory. 


7.2.1 js Decay 


The muon (u*) is a heavy version of the e*, with mass m, ~ 105.7 MeV and lifetime 
Tą = 2.20 x 107° s. The u` decays nearly 100% of the time into e7 v„Pe via the weak 
charged current, including a small electromagnetic radiative correction leading to e~ Vey. 
(The properties of the z+ are analogous by CP invariance.) The diagram for u~ > e~ Vy, 
is shown in Figure 7.5. The Lagrangian density’ 


e` (pa) 
Vu(p3) l | De(p2) Le (p2) Palpi) 
: f 


vA y y 


A 
---->--- 


m (q) 
Figure 7.5 Left: Feynman diagram for p~ — e~ V„De. Right: diagram for 7” > ~D,- 


G 
-L=H= A Pup (1 — 7) u ey" (1 — 7°) Ve (7.32) 


corresponds to a matrix element 


iGF _ Bin. 5 
M =—-—=t 1— uy tay" (1 — va. 7.33 
Ja ul qg”) u tay” (1—77) ve (7.33) 
The differential decay rate is 
d= (27)* 54 (p2 + p3 + pa — pı) d? pod’ p3d°p4|M |? (7.34) 
2M, (2r)? 2E (27)? 2E; (27)? 2E4 l 


It is sometimes convenient to use the Fierz identity in (2.215) on page 43 to rewrite this in the charge 
retention form —L = Sr an (1 — y5) L Day" (1 = 7°) Ve. 
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|M|? can be calculated using the standard trace techniques of Chapter 2. Neglecting m,, 
G2 1+7° ¢ 
|M}? s D [rs (1 = 7°) (pi + my) (= Ww (1 a 7”) Bs 


d k (1-37) Boy” (1—7) (Ba + me) (E) N 


where we have kept the u~ and e~ spin projections because many experiments have mea- 
sured them. For now, however, let us just calculate the total decay rate, obtained by aver- 
aging (summing) over the u~ (e7) spins (for m, = 0 only the left (right)-handed v, (De) is 
produced), 


= 1 
|M|? > |M]? = > > MP. (7.36) 


SuSe 


Neglecting the electron mass as well (m./m, ~ 1/200), 


|M]? =, lr [Yu (1 J 7°) (pi + My) YW (1 T 7°) ps 
x Tr [y* (1-35) poy’ (1-35) wal, 
where the m, term doesn’t contribute because it involves the trace of an odd number of 


y matrices. Despite the formidable appearance, (7.37) is straightforward to evaluate using 
the identities in (2.172) on page 36. One finds 


|M? =GETr [Yu bii B3(1+7°)] Te h" poy” pa (1 +7°)] 
=16G} [PipPav + PivP3u — GurP1 ` P3 + teprvw PPS | 
x [pops + pant — ge - pa + ie”? poppao] 
=16G% [2p1 - paps ` pa + 2p1 ` papa ` P3 — Epuru”? PI P3 PapP4c] » 


(7.37) 


(7.38) 


where the contraction of the two tensors has been simplified because the first three terms 
in each are symmetric in u and vy, and the last is antisymmetric. But 


Euvrwt” P? = —2 (gh gl, — gh.97) (7.39) 


from Table 1.2, so the last term is just 


2 pi: P2P3 ` P4 — 2 P1 ` Pape ` P3, (7.40) 


and therefore 
|M|? = 64G%p1 ` P2 p3 ` pa. (7.41) 
Incidentally, this calculation justifies the identities in (2.175) on page 37, noting that the 
sign of the last term in (7.38) is reversed for the 1 — Ay° case. 
Assuming that the neutrinos are not detected, we can integrate over their momenta to 
obtain the unpolarized differential decay rate 


4G? @ papi py fd? pod? p3 


ar = 
(27) Esm, E2 Es 


ôt (po + D3 — q) P2pP 30: (7.42) 


where q = pı — p4. To evaluate the integral, we note that 
_ f ËE 


Ti . = z 7.43 
p E, E 0° (p2 + p3 — q) P2pP3 ( ) 
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is a second-rank tensor, which can only depend on q. It must be of the form 
Ipo = Agpo + Bade; (7.44) 


since gps and qpqo are the only tensors available. We have extracted a q? in the first term so 
that A is dimensionless. A and B can be obtained by evaluating the much simpler integrals 
(Problem 7.1) 


g” Ipo = 4A? + BP =70, P Ip = (A + B) ¢ = T (7.45) 
Therefore, 
A= L B= us => Ino= L La" ie + 20p90] ; (7.46) 
6 3 6 
and 
~ mG? dp. 
d? = E {S [ir pa)” pi + pa + 2pr > (pr — pa) pa > (Pı -= pa)| . (7.47) 
3(27)° Eam, 
In the muon rest frame, neglecting the other masses, 
Pı = (m0) , Pı ` P4 5 M, Es, p? = Ma ri = 0, (7.48) 
yielding 
- 2G? dD, 
dr = TEA Esm, [ m, 2m, E4) Mp E4 + 2M, (mp a E4) my, Ea] 
on? (7.49) 
T 
= 3r) 2rd cos 0dE, [3m] — 4m, E4] Ej, 


My 


where @ is the polar angle of the e~ and 0 < E4 < -* is its energy. (E4 = 0 corresponds to 
two back-to-back neutrinos carrying all of the energy, while E4 = m,,/2 corresponds to the 
two neutrinos being in the same direction, so that po - p3 = 0). Integrating over cos@ and 
defining the dimensionless e~ energy € = 2E4/m,,, we obtain the final result for the muon 
lifetime, 


~_ Gam fr! Gam? 
-1 __p_ F "pH 2 E 
p= O J de 2 €* [3 — 2e] = TEL (7.50) 


which can be used to obtain the Fermi constant Gr = 1.17 x 107° GeV~? from the observed 
lifetime T,. 

The muon lifetime is known extremely well, 7, = 2.1969803(22) x 107° s, from the 
MuLan experiment at PSI (Tishchenko et al., 2013), and Gr is a critical parameter for the 
precision electroweak tests (i.e., for the prediction of the W and Z masses in the standard 
model), and also for tests of CKM universality. To extract a precise value one must include 
electron mass and two-loop electromagnetic corrections® (some one-loop diagrams are shown 
in Figure 7.6) to the lifetime formula, yielding (Patrignani, 2016) 


Gm, /m? 25 â(m?) â? (mi) 
=À Few e 2 H H 
z F 14 C , 7.51 
Tu 19273 (=) x ( 4 7 ) Qqr 7 T? ( ) 
r_a yN 
Fermi,with me#0 radiative corrections 


8The QED radiative corrections to Fermi theory are finite to all orders in a and leading order in Gp (for 
which there are no In My effects), provided the usual QED renormalizations are applied (Kinoshita and 
Sirlin, 1959; Berman and Sirlin, 1962; Sirlin and Ferroglia, 2013). On the other hand, the QED corrections 
to 8 decay are logarithmically divergent (they become finite in the full SM). The difference, which has 
nothing to do with the strong interactions or the neutron mass, is explored in Problem 7.4. 
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where 


F(x) = 1 — 82+ 82° — x4 — 1227 ln x 


is = = 2 m 

Alm)! = a> 1n (BE) + O(a) = 135.901 (7.52) 
_ 156815 518 , 895... 67 4 535 

G= a ar” — a6 O) + 7997 a Me). 


â(m?) is the running QED coupling in the MS scheme and ¢(3) ~ 1.202 is the Riemann Zeta 
function. We have omittted small mixed Mme — @ and hadronic terms, and a W-propagator 
correction in the extension to the SM is incorporated in the relation of Gr to My,z. Using 
(7.51), one obtains Gp = 1.1663787(6) x 1078 GeV~?. 


Figure 7.6 Representative low-order diagrams contributing to u decay in the Fermi 
theory, including one-loop and initial-state radiation diagrams. The cross represents 
a mass renormalization counterterm. 


The expression in (7.50) can be generalized to allow for a polarized muon and measure- 
ment of the electron polarization: 


GEM, o o 1 — 2 1— pe: ŝe | dedcosé 
T= TE [2e° (3 — 2e)] h ' G=) cos6] | ; | 5% (7.53) 


where cos 0 = Pe: ŝ, is the cosine of the angle between the electron momentum and the muon 
spin direction, and pe- ŝe/2 is the electron helicity. These formulae can of course be extended 
to include higher-order corrections, the electron mass, and the W propagator (Commins and 
Bucksbaum, 1983). The 2e? (3 — 2e) factor yields an e~ energy spectrum characteristic of 
V—A (see Section 7.2.7 and Problem 7.2). The pe’, and p--8- asymmetries reflect the parity 
nonconservation. The pe Se term implies that the e~ helicity is -4 in the limit me = 0, 
another characteristic of V — A. The cos@ term describes a correlation between the u7 
polarization direction and the electron momentum for a given e~ energy (the coefficient of 
cos 0 reverses sign for u* decay). This can be used to determine the polarization direction of 
a sample of muons, and was used, for example, to determine the muon spin precession in the 
Brookhaven experiment measuring the muon anomalous magnetic moment (Section 2.12.3). 
Similarly, the polarization of T’s produced in ete — rtr" in the Z-pole experiments at 
LEP was determined from the angular distributions of the decay products in the analogous 
leptonic T decays, such as T7 —> W~ vru- 

One can also carry out an analysis allowing a more general four-fermion interaction 
including S, P, and T, as well as V and A interaction terms. There are various parametriza- 
tions (see the reviews by Fetscher and Gerber in Langacker, 1995; Patrignani, 2016), such 
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as 


4Gp g E i E 
H = A (IYL EL Yo VeL PuL YP HL + IRRERYp VeRPuRY HR 


+ GL REL Yp VeL PuRYP HR + JGL ER Yp VeR PuL Y? HL 
Los g - S > = (7.54) 
9LLeLVeRYpRUL + GRRERVeL YL HR 


3 s = S = = 
+FJLRELVeRYVuLHR + JRLERVeLVuRHL 


+ GEREL tpo VeR DuL t7 UR + Jhi ĒR tpo VeL Pur t” uL) + h.c., 


where t?? = o”? /4/2. Equation (7.54) is the most general form assuming lepton-number 
and lepton-family conservation, but is actually applicable in the more general case in which 
arbitrary neutrinos and antineutrinos are emitted, if the neutrinos are not observed and their 
masses are negligible (Langacker and London, 1989). The Fermi theory predicts gf, = 1 
and others = 0. However, the other g”’s could be generated in left-right symmetric theories 
involving a second Wr that couples to V + A, i.e., by Wr exchange or W — Wp mixing, 
or by mixing of the known leptons with heavy exotic fermions. The g* could be generated 
from the exchange of a spin-0 particle, such as a non-standard Higgs boson or by R-parity 
violating couplings in supersymmetry. g7? could be associated with the exchange of a spin-2 
particle. The coefficients must be relatively real if T holds. A more detailed discussion of 
new physics contributions to u decay is given in (P. Herczeg, in Langacker, 1995; Kuno and 
Okada, 2001). 

There have been many precise experiments on muon decay and inverse muon decay, 
Ve~ —> W` Vve, including measurements of the electron spectrum and helicity, and of cor- 
relations involving the muon spin, at PSI, TRIUMF, and elsewhere. The data are suffi- 
cient to establish |g¥,| > 0.960 with (usually stringent) upper limits on the other cou- 
plings (Gagliardi et al., 2005; Hillairet et al., 2012; Patrignani, 2016). This excludes the 
possibility of alternatives to V — A as the dominant contributor to muon decay and inverse 
decay. One can also search for small deviations from V — A, e.g., due to small admixtures 
of effects involving the types of new physics mentioned above. It is convenient to generalize 
(7.53) to a general four-fermi interaction, 

ae = Cem D 
ae 19273 16 


e (2a —e)+ “(8 — 6) 


(7.55) 
F P E cos ka — e€) + TAS _ ol) an 


where additional terms involving me, the ef helicity, and radiative corrections are not dis- 
played. The Michel spectral parameters (Michel, 1950; Kinoshita and Sirlin, 1957) p, €, ô, 
and 7 (which appears in the me/m, corrections) and the overall normalization D are func- 


tions of oo while P, is the u polarization from m? —> pt DR The experimental values 


in Table 7.1 are in impressive agreements with the predictions of the V — A theory, 


p=b=-~ €=1, n=0, D=16, P,=1. (7.56) 


Very precise measurements of p, 6, and P,,€ from the TWIST collaboration at TRI- 
UMF (Hillairet et al., 2012), incorporated in Table 7.1, can also be used to set limits (Bayes 
et al., 2011) on a possible Wz mass and on a W — Wp mixing angle |¢,r| in an extension 
of the SM to SU(2); x SU(2)r x U(1) (Section 10.3). Other tests of the V — A theory of 
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TABLE 7.1 Experimental values (Patrignani, 2016) of the Michel parameters and their 
expectations in the V — A or standard models.“ 


Parameter Experimental Value V-A 
p 0.74979 + 0.00026 3 
n 0.057 + 0.034 0 
6 0.75047 + 0.00034 3 
Pu & 1.0009+9:9016 1 
P, &8/p 1.0018+9:9016 1 
Po 1.00 + 0.04 1 


“P+ is the e+ longitudinal polarization (twice the helicity). 


charged current leptonic interactions are provided by leptonic 7 decays, T > ¢vv, and by 
inverse muon decay. 


7.2.2 Vee” —> Vee 


In the Fermi theory the processes y.e~ + vee™ and Dee™ — ve proceed via the leptonic 


weak charged current interaction in (7.19), by the diagrams in Figure 7.7. The Lagrangian 
density is 


=LĻL=H= Sanki =)e ey" (1—77) ve 
= T Deya (1—75) ve eg” (1—7) e (7.57) 


> E Deu (1 — 7°) ve E7” (gv — ga”) e. 


The second (charge retention) form is obtained using the Fierz identity in (2.215) on page 43. 
The last form looks ahead to the full standard model, in which there is an additional neutral 
current contribution. The Fermi theory predicts gy = ga = 1. The spin-averaged squared 


e` (ps) Velpa) e (ps)  De(pa) 
Ko f NA 
se yk J M 
Ve(pı) e` (p2) De(p1) e` (p2) 


Figure 7.7 Diagrams for vee™ + vee™ and Dee™ — Ye” in the Fermi theory. There 
are additional weak neutral current contributions in the full standard model. 
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amplitude can be calculated in the same way as for u decay. One can use (2.175) to obtain 


1 
2 XO |M}? = 16G%[(gv + ga)’p1 -p2 ps pa + (gv — ga)” pi: P3 D2 pa 


81828384 
— (gy — ga)mé pı ` pa]. (7.58) 
(The incident ve’s all have helicity —1/2, so only the e~ spin is averaged. It is convenient 


to include sums over the ve spins, even though the contributions of the h = +1/2 states 
vanish.) Let us define the invariant 


. (s—m2)(1+cos 613) 
y= Pi Pa _ a a CM (7.59) 
Pı ` P2 =, lab 


where s = (pı + pz)? = m?2+2m-E,, E, is the incident v energy in the electron rest frame 
(i.e., the lab frame), and Te is the kinetic energy of the final electron in the lab. y, the 
fractional energy transfer to the final electron in the lab, runs from 0 to (1 + m,-/2E,)7}. 
Then, 


dove Ghm.E, Me 
ee = SEE Nay +4)? + (ov = ga)? C = W)? ~ 0b ~ 98) Fea 


1 do E GpmE, | 


ve — d 


(gv +ga} + (gv = ga] ; (7.60) 


where we have dropped the last (m-y/E,) term in the total cross section for simplicity. 
(The limits on y and the (invariant) cross section formula are most easily derived in the 
CM and then expressed in lab variables.) Similarly, for Dee™ > Dee”, 


doze GEmeE, 2 2 2 2 2\ Me 
= t= 
gee CEE flov + ga)?(I = y)? + (ov = gA)? = (È = A FEY 
Gm. E, [1 
Ope = A [50v + ga)” +(gv — ga)” ; (7.61) 


ve scattering was not actually observed until after the standard model was developed, and 
since then most of the experimental studies have involved vy, or 7, beams, since they are 
produced much more easily from m and K decays at accelerators. As will be described in 
Chapter 8, the different y distributions for the analogous v,,e (De) neutral current processes 
in the standard model were useful for determining the parity structure of the neutral current. 


7.2.3 mand K Decays 


The Te Decays: 7 — uv, ev 


The pions, 7+ = ud (du) and 7° = (uu — dd)/,/2, are much lighter than the other hadrons, 
with m,+ ~ 139.6 MeV, which is somewhat larger than m,o ~ 135.0 MeV due to electro- 
magnetic corrections. As described in Section 5.8, the small value of m, can be understood 
because of their role as pseudo-Goldstone bosons of an approximate global SU(2), x SU(2)r 
flavor symmetry of the strong interactions. The 7+ decay via the weak charged current, 


mainly into p+ D), denoted 7,2, with a relatively long lifetime of T,+ ~ 2.6 x 1078 s. The 7° 
decay is electromagnetic and mainly into 2y, with a much shorter lifetime 7,0 ~ 8.4x 1071" s. 
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The role of the chiral anomaly and the number of colors in the 7° lifetime was briefly de- 
scribed in Section 5.2. The branching ratios for the principal decay modes of the pions and 
charged kaons are listed in Table 7.2. Neutral K decays will be considered in Section 8.6. 


TABLE 7.2 Branching ratios for the principal decay modes of the pions and charged kaons.* 


mt > pty, (Ty2) 99.99% 
> eTe (Te2) 1.23 x 1074 Universality and V, A tests 
> etven? (Te3) 1.036 x 1078 m beta decay 
> pty 2x 1074 
Ww — 2y 98.8% Electromagnetic 
> ete-y 1.2% 
sete ete” 3x 107° 
Kt > pry, K2) 63.6% fe/ fr 
> eT Ve Kez) 1.6 x 107° 
=> pty,7 Ky) 3.4% Universality test 
> etven? Ke3) 5.1% Universality test 
>rt? Kor) 20.7% Nonleptonic 
> ntrtr K3r) 5.6% 
> n trln? K3r) 1.8% 


"The branching ratios of the conjugate 7~ and K- decays are the same by CP invariance. 


The Feynman diagram for the semi-leptonic decay process 7~ — pV, is shown in 
Figure 7.5. The matrix element is 


M= -itk ayy (1—75) v1 (017™* (0)|n-(q)), (7.62) 


where pı = pp,,, P2 = Pus q = Pr- = Pi + p2, and the hadronic current is 
Tht = VM — AM = ay, (1 — 7°) [dcos 6. + ssin 9] . (7.63) 


The first (cos@.) term is relevant to m decay. We saw in Section 2.10 that the pion is 
a pseudoscalar, P|r'(p’)) = —|m'(—p’)). Reflection invariance? then implies that only the 
axial current contributes to (0|J’'|m~(q)). To see this, recall that PV“'P~! = VÝ for 
the vector current, and that the vacuum is invariant, P|0) = 0. By Lorentz invariance, the 
matrix element must be of the form 


(OVi |x (q)) = iag", (7.64) 


where a is a constant, since q is the only four-vector available. (The current is evaluated at 
(t,£) = 0.) Therefore, by an argument similar to (2.275) on page 53, 


iag! = (0|P~* PV"! P" P| (q)) = —(0|V} |r (q')) = —iaq,, (7.65) 


where q = (Eq, 7) and q' = (Ey, =), i.e., qu = q“. Thus, a = 0. 


©The weak interaction parity violation is explicit in the V — A form of the current. The matrix elements of 
each part of the current are (reflection invariant) strong interaction quantities, up to negligible higher-order 
weak effects. 
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However, PAŁ P-t = —A”rÌ, so the axial matrix element can be non-vanishing. By 
Lorentz and translation invariance, 


(0| APT (x) | (q)) = i cos be frg e~t”, (7.66) 


where the cos 6, is extracted for convenience and f, is the pion decay constant introduced in 
(5.106) on page 191. (We saw in Section 5.8 that fr is related to the spontaneous breaking 
of SU (2)z x SU(2)r.) Thus, 


—frMm,G'r cos be z 
V2 


where we have neglected m,. Summing over the final spins 


z Pm? 
i = S IM? = EGR cos? Ace (A — v5) hi (1+5) (Po + mu), (7.68) 
PTT ——— 


8p1-p2=4(m2 —m2) 


so that 


2 
2 2 2 2 
Parts gp = Geos Sole Mn 2 (: = | . (7.69) 
m 


8mm? 81 


Using Gr from u decay and cose ~ 0.9742 from 8 decay, the observed lifetime yields 
fr = (130.50 + 0.14) MeV ~ 0.93m,-, where the uncertainty is dominated by higher- 
order effects (Patrignani, 2016). This is in remarkable agreement with the lattice QCD 
prediction (Rosner et al., 2015) 130.2 + 1.7 MeV. 

The rate for 7* + e™ve(De) is of the same form as (7.69) except m, — me. Therefore, 
one predicts 


T ae 2 2 m2? 
2.2 (7 > ev + evy) a me (3 me) (1+ O(a) 
P(r > py + vy) m2 \ m2 — me 


= 1.28 x 10-4 (1 + O(a)) = 1.2352(2) x 1074, 


(7.70) 


where O(a) is a radiative correction (Bryman et al., 2011). This is in perfect agreement with 
the value Ry = (1.234 + 0.003) x 1074 from the PIENU experiment at TRIUMF (Aguilar- 
Arevalo et al., 2015), and supports the universality of the e and p charged current interac- 
tions. Neutrino beams at accelerators are obtained mainly by the decays of m+. The small 
value of R, is the reason that such beams are mainly v, and v,,, with little Ve or De (some 
Ve are produced, e.g., from K or secondary p* decays, but these are mainly considered as 
backgrounds). 

The suppression of the 7-2 mode 7 — ev is a consequence and test of the V, A structure of 
the charged current. In particular, V — A is just the left-chiral projection 2 Pz. Since chirality 
and helicity are the same for a massless fermion, weak charged currents imply helicity 
h = —1/2 for e~,p~,ve, and Vu, up to corrections of O(m/E) in amplitude. Similarly, 
h = +1/2 for et, u”, De, and Du. Since the neutrino masses are negligible in this context, 
the D in m— — €~ i must have h = +1/2. But the pion has spin-0, so angular momentum 
in the rest frame requires h = +1/2 for the charged lepton as well. This is the wrong 
helicity, leading to the m2/ mi, factor in R,. Thus, the electron mode is strongly suppressed, 
despite the partial compensation from the larger phase space and from the trace. Similar 
conclusions would actually hold for any combination of V and A, which always lead to 
opposite helicities in the massless limit. Things would be different for a scalar-pseudoscalar 
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interaction M ~ ü (1+a7°) v2, which leads to equal 47 and 7 helicities for massless 
fermions. A pure S, P interaction would predict 


m= (4 


2 
mź =m 


) = 5.49, (7.71) 


ENO vw 


which is clearly excluded. The value of R, can also set limits on small S, P perturbations 
on the dominant V — A and to violations of universality. 


The Weak Currents and SU (3)z x SU (3)r Chiral Symmetry 


In (5.97) and (5.98) on page 189 we defined an octet of vector and axial generators of 
SU(3)z x SU(3)r and their associated Noether currents, i.e., Vi = Tuq and Ai, = 


naa à q. The hadronic weak charged currents are simply related to some of these, e.g., 


üy yd = Al, + iA, Ups = A, + iA}, (7.72) 
and similarly for the vector currents. Defining 
J, = Vi — Ap (7.73) 


the hadronic weak charged currents are 


IM = (J) + iJ?) cos 0, + (Ji + iJ2) sin 6, 


J? = (ght)! = (J} — iJ?) cos 6. + (J4 — iJ’) sind ae 
H H H H c H H ce 
Similarly, the hadronic electromagnetic current is 
2 ae 
Jo = yuyu zd Md 337s 
1 = 1 7 
z zu dy"d) 4 gu + dy'd — 257" 8) (7.75) 
1 
=V Ve 
V3 


These relations are useful because they allow us to use SU(3) and SU(3)z x SU(3)R to 
constrain and relate their matrix elements. In particular, in the SU(2) and SU(3) limits 
one can relate the form factors of the weak interaction vector currents to those measured 
in electromagnetic transitions, using (7.75). This all seems obvious in the quark model, 
but was inferred earlier and was known as the conserved vector current (CVC) hypothesis. 
Similarly, the axial currents and the pseudoscalar mesons in (3.91) both transform as SU(3) 
octets, so in the SU(3) limit one must have 


(ALO la) = Hagh, ij = 1-0-8, (7.76) 

where the v2 is inserted to coincide with the definition in (7.66) of the pion decay constant, !° 
7 2 1 1a A2 l 32 cp p 

(Oltyp,ysd|a~) = Vg +tAt |r — in") = +ifrq". (7.77) 


Recall that these relations motivated the partially conserved axial current (PCAC) hypoth- 
esis in (5.119) on page 193. Of course, SU(3) is broken by ~ 25%, so one expects that the 
values f, and fx actually measured in m and K decays may differ by up to this amount. 


10Some authors define fr without the v2 in (7.76), so that it takes the value ~ 92 MeV. 
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The Kez Decays 


The K= = uš5(sū) are pseudoscalar mesons with mg+ = 493.7 MeV and a typical weak 
lifetime of T+ = 1.24 x 1078 s. The neutral kaons K? = ds and K? = sd are somewhat 
heavier (497.6 MeV) because the Mma > Mu quark contribution to the splitting is larger than 
the electromagnetic one. The neutral K decays have special properties due to K? — K? 
mixing and CP violation, and will be described in Section 8.6. The branching ratios for the 
principal K+ decays are given in Table 7.2. The dominant mode is Ky,2, i.e., KT > W` Dp- 
The calculation is identical to 7,2. The hadronic matrix element is 


(OJJ (E)K (Q) = -isin befr et? , (7.78) 


where the kaon decay constant fx may differ from fy due to SU(3)-breaking. Again, only 
the axial part of J’“' contributes due to the reflection invariance of the strong interactions. 
The rate is 


u 2 
8m Mig 


2 
7 2 sin? 0 f2 m? 
Peete Cpein Vein MK y? (: | ; (7.79) 
The Cabibbo angle factor sin ĝe ~ 0.2253 is known independently from Ke3, 8, and hy- 
peron decays, allowing one to determine fx from the K lifetime and branching ratio, i.e., 
VK-y-v, = B(K~ > pd, )/TK-. The result is fx = (155.7 + 0.5) MeV ~ 1.19fr, to be 
compared with the lattice QCD result 155.6 + 0.4 MeV (Rosner et al., 2015). 


The m3 and Kes Decays 


The m3 and Kgg decays are the three-body semi-leptonic decays, t+ > n?etve (pion beta 


decay), Kt > n°étyp, and Kz, s > n+l (De), and their CP conjugates, where £ = p 
or e. Kz s refer to the mass eigenstate neutral kaons ~ (K? + K°)/,/2 (Section 8.6). 


The matrix element for m+ > nlet ve is 


M = -i FE ty = Wwe WIT Ole) (7.80) 


where the relevant part of J’ is the first term in (7.74). It is straightforward to show (Prob- 
lem 7.9) that only the vector current contributes due to reflection invariance. Furthermore, 
by Lorentz invariance, the matrix element must be of the form 


(T° (p) |J” (x)|n* (p)) = (2° (p) |V (x) — iV" (x) |r" (p)) cos 8e 
= [FP +p) + f-@)(' — p)"] o 


where f+ are form factors analogous to the electromagnetic matrix element in Section 2.12.4. 
Similarly, the matrix element for K+ > nlt is 


(7.81) 


M= i diy yy(1 — 7) ve (n° (p") J (0) K+ (p)), (7.82) 


where 


(1° (p')| J"! (a) |K* (p)) = (n(p)|V4¥ (a) — iV*™ (a) (p)) sin Be (7.83) 
= EP +p)” + FE — p)"] sin gei? ZP), : 


Time reversal invariance implies that f} and f_ are relatively real, as are f£. 
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It might seem difficult to say anything, given the existence of two form factors factors 
for each process. However, in the SU(2) or SU(3) symmetry limits one can say a great 
deal. That is because the V;! are the Noether currents associated with the SU(3) generators 
F". In particular, they are conserved, 0“V,; = 0, implying f_(q*) = 0. Furthermore, the 
pseudoscalar fields transform as an SU(3) octet, i.e., 


[F’, 7] = (Laas) jr a =a ign i,j,k = Teg, (7.84) 

using (3.16) on page 93. This implies that 
Fij) = | BEVI) = ifia”), (7.85) 
since F"|0) = 0 and since F"|7) should not contain states outside of the octet (this argument 
will be made more precise in Section 7.2.4). From the general form analogous to (7.81) or 


(7.83) this implies 


(nE (P| VELTI (p)) = ifi fP + p) pel? E, (7.86) 


with f(0) = 1. For pion 8 decay the q? dependence can be ignored since m2 < q? < 
(m,+ — Mp0)? < m2. Therefore, 


Vi —iV2|nt) = ae mV — iV 2 |a! + in? 
u u v2 u u 
. 7.87 
== 1 > s 1 _ V2 1 ( ) 
E [ie123 — t€213] (P' + p) = —V2(p' +P), 


i.e., the prediction of CVC is f,(q?) ~ f+(0) ~ —V2. (See the comment on the 7+ phase 
convention in Problem 3.12.) 


Pion Beta Decay 
From (7.80) and (7.87), the matrix element for pion beta decay is 
M = iGr cos be (p +p’), ty" (1—7°) ve. (7.88) 


A good first approximation to the rate is obtained by neglecting me and working to leading 
order in 


A=m,+ — Mpo ~ 4.6 MeV . (7.89) 

Since A is much smaller than the t+ mass, the 7° is produced nearly at rest and the 
3-momentum po = —pe — p, is very small. Therefore, 

M ~ 2iGp cos O.mztyy? (1 — 7°) ve. (7.90) 


The leptonic part can be evaluated by writing out the Dirac spinors explicitly, most conve- 
niently in the chiral basis using (2.195) on page 40, 


ty? (1— 7°) ve = 2up(—)ve(+) 


. . bev (7.91) 
= -4V E.E, $- (Bv)' X+ Be) = —4V EE, cos, 


244 The Standard Model and Beyond 
where in the massless limit only the +(—) helicity et (ve) is produced. We have used the 
explicit helicity spinors from Table 2.1, and 6¢, is the angle between pe and py. Therefore, 

||? = 32G% cos” 0em2 Ee Ey (1+ cos Oey) , (7.92) 


which could also have been obtained using standard trace techniques. 
The decay rate is 

1 PPro dP, Ape 

16(27)5 Mr E,o E, Ee 


dl = 6t (Dat — Pro — Du — De) |MI?. (7.93) 
We will evaluate the phase space integral directly, though the more powerful techniques in 
Appendix D are more useful in the general case in which A and Mme are not small. One can 
use the ô function to do the p,0 integral, and approximate 


Exo = y (By + Be)? + m2. ~ Mao, (7.94) 


as was already assumed in (7.90). The pe and p, integrals are therefore unconstrained except 
for an energy-conserving 6(A—E,,—E,) factor. The cos ĝe, term integrates to 0 in the overall 
rate. Therefore, 


1 Pp, pe 


T= d(A— E, — Ee) |M}? 
16(27) m2 E, Ee ( ) |M] 
2 2 
= es J E?dE, E2dE, ô (A — E, — Ee) (7.95) 
T 
O GR cos? beng f! ‘ 2 Gf cos? 0A" 


Including the leading corrections (see (Sirlin, 1978) and Problem 7.7), one predicts 


2 29. A 2 
GF COs 3 ome + radiative +--+- | ~ 0.399(1) s7+, (7.96) 
2 My A? 


T ~N 
3073 


Ae h 


in excellent agreement with the experimental value 0.3980(23) s~! by the PIBETA collab- 
oration at PSI (Pocanic et al., 2004). 


Ke3 Decays 


In addition to pion beta decay, tt > n?etve, the Keg decays Kt > n°l*1,~ and Kr s > 


m+ €*ve(de) are very important as tests of CVC and especially as a measurement of sin be. 
In the SU(3) limit the values of f,(t) at t = q? = 0 are predicted from (7.86) to be 


1 = = 
froma, e oea e (797) 


NA 


The ror prediction follows from SU(2), while the others require SU(3). However, the 
relative values for K+ and K° follow from isospin, and the relation between K° and K? 
follows from the charge conjugation invariance of the strong interactions. Since isospin 
is a good symmetry at the percent level, one expects the 7+ — 7° prediction to be quite 
reliable. However, one might expect a large deviation of the SU (3) result for f°", perhaps 
as large as the fg/fr ~ 1.19 we found for the axial matrix elements from Ky¢2/7¢2. In fact, 
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the relation of the vector currents to the generators leads to the Ademollo-Gatto theorem, 
which states that the deviations of the f,(0) due to SU(2) and SU(3) breaking are actually 
of second order, and should therefore be small. We will derive the Ademollo-Gatto theorem 
in the next section, but first will consider the Ke3 decays in more detail. 

The Kgg decay rate is (Leutwyler and Roos, 1984; Antonelli et al., 2010a; Cirigliano 
et al., 2012; Patrignani, 2016) 


Gb s 276 
where 
1 (mK—mx)? 2 2\ 2 
Ih = — | dt \3/2(m2.,m2,t) (1 i =) ( = =) F(t). (7.99) 
Mig Jm? 2t t 


AÀ is the kinematic function defined in (2.39) on page 14 and 


E — m2)? ong 
P(e) = Fue? + Ee fey (7.100) 
with i : 
Ros t, AwW=RAW+——  ™. (7.101) 


f+) mz — ms f+(0) 

There are also radiative correction factors that we do not display. A special case of (7.98) is 
derived in Problem 7.8. A complication in the Ky3 decays is that t is not necessarily small, 
so the t dependence of f+ must be included. Fortunately, this can be directly measured, e.g., 
by the pion energy distribution. Even though the symmetry breaking effects in f+(0) are 
second order, they must still be estimated by chiral perturbation theory or lattice techniques 
and taken into account. Finally, the symmetry breaking may also induce an f_ form factor. 
This leads to a contribution proportional to me in the amplitude, which is non-negligible 
for K„3. Despite these complications the Ky3 system has been carefully studied, because it 
leads to the most precise determination of sin ĝe (or its generalization to the CKM element 
Vus in the three-family case.) 


7.2.4 Nonrenormalization of Charge and the Ademollo-Gatto Theorem 


Consider the question of whether electric charge is renormalized by the strong interactions!! 
for the example of the pion. As discussed in Section 2.12.4, the matrix element of the 
electromagnetic current is 


(17 (p')|J6(x)|a* (p)) = [e + p)" F2 (a?) + (p! — p)* f2(q?)| et, (7.102) 
where q = p’ — p. The electric charge, in units of e, is f2(0), so the nonrenormalization 


requires fe (0) = 1 in the presence of the strong interactions. 
To see how this comes about, recall that the U(1)g generator is 


Q(t) = / PEJSE, 2), (7.103) 


11The gauge coupling e is renormalized, and may depend on the scale, but that effect is universal for 
all charged particles. See (D’Alfaro et al., 1973) for an extension of this argument to include higher-order 
electromagnetic effects. Higher-order electromagnetic effects also contribute a small q? dependence to f+. 
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that current conservation WIG = 0 implies ae = 0, and that the transformations of the 
charged pion fields, which can be viewed either as elementary or as composite as in (5.102) 


on page 190, are 


[Q,0*] = F7. (7.104) 


All of these statements are obviously true for free fields, but continue to hold to all orders 
in the strong interactions since they are U(1)g invariant, i.e., [Q, H] = 0. Taking the 


divergence of (7.102), current conservation immediately implies q? f? (q?) = 0, and therefore 


JE) = 0 since strong interaction effects are not expected to produce a delta function. 
The nonrenormalization can be established by taking the matrix element of (7.104) between 
a single pion state and the vacuum: 


B 
(oI 1) = f gam C OR ON O W 
+ E DIQ) (arIo) = +(x). 


nent 


(7.105) 


We have assumed that U(1)g is not spontaneously broken, i.e., Q|0) = 0. The last 5°, term 
is a sum or integral over all single and multi-particle states other than the 7*. But, 


(x* (p)|Q|x* (k)) = [ze we [ip + KLR) + (p= k)° JE (g?) 
= (27)? (P — k)2Ep fS (0), (7.106) 
so the first term in (7.105) is fÊ (0) (r+ (p)|r7]0}. Similarly, 


(rt (p)|Q\n) = (27)? (8 — pa) (wt [JQ (0) |n) e FE. (7.107) 


Since dQ/dt = 0, only (positively charged) states with E, = En and p= pn, and therefore 
Mz = Mnp, can have nonzero matrix elements. (This can also be seen from (3.52) on page 99.) 
There are no such states, so the last term in (7.105) vanishes and ie (0) = 1. We note that 
the strong interactions will, however, introduce a strong q? dependence to fÈ, and also 
induce a magnetic form factor for nucleon matrix elements. These effects can be taken from 
experiment. 

Exactly the same reasoning can be used to establish the nonrenormalization of the SU (2) 
and SU (3) generators in the symmetry limit, as expressed in (7.85) and (7.86), by taking 
the matrix element of (7.84) between (z*| and |0). The electric charge generator is a special 
case. 

Now let us turn on SU(3) breaking and establish that the deviations of the Ke3 form 
factors f/*(0) from the predictions in (7.97) are of second order. This is most easily seen 
from the commutator relation 


[Ft + iF", F*—iF?] = F’ + V3F8 = I + y. (7.108) 


This is expected to hold even in the presence of SU (3) breaking mass and interaction terms, 
as discussed in Section 3.2.2. [3 and y are, respectively, the unbroken third component of 
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isospin and strong hypercharge, defined in (3.89) on page 104. Taking the matrix element, 


(a+ (p's + Sl (p)) = (2n)°2E, 5H" — P) 
= (xt (p')| [F4 + iF", F4 — iF*] |x*(p)) 


=| Eat (pE + iF IRIE) ROK) — iF ln ()) 


(27)32E;, 
+ So (nt | Ft + iF? |n)(n|F4 — iF? |rt) 
n£#K° 
— So (at |F4 — iF? In) (n| F4 + iF? rt). (7.109) 


n 


The K? term can be evaluated using the analog of (7.83), but with oor, It is a Lorentz 
invariant, and is conveniently evaluated in the |p| — co frame, to obtain 


(1E (0), (2n)*2B,6°(p" P). (7.110) 


The remaining terms, involving the leakage out of the pseudoscalar octet, are of second order 
in SU(3) breaking, as is apparent from (3.52). Since f ~*" (0) + —1 in the symmetry 
limit, we obtain the final result 


OP (0) = —1 Oe), (7.111) 


where e2 is the SU(3) breaking parameter defined in (3.107) on page 108. This is the 
Ademollo-Gatto theorem (Ademollo and Gatto, 1964; Behrends and Sirlin, 1960; Fubini 
and Furlan, 1965; Marshak et al., 1969; D’Alfaro et al., 1973), which implies that SU(3) 
is fairly reliable for f# (0). Similarly, deviations from the SU(2) prediction in (7.87) are 
expected to be second order in isospin breaking, and therefore very small. Similar results 
hold for the vector (but not axial vector) form factors in nucleon and hyperon 8 decay. 

One can extend these considerations to chiral SU (3)z x SU (3)r. In the chiral limit the 
pseudoscalars become massless and deviations from the symmetry predictions are sometimes 
dominated by calculable non-analytic terms generated by infrared singularities (Pagels, 
1975). In the case of the f;(0), the leading corrections to the chiral limit are of order 
€2/e9 (Langacker and Pagels, 1973), where €o represents the explicit chiral breaking, such 
as the common quark mass term in (5.91) on page 188. This is of first order in chiral SU(3) 
but second order in ordinary SU (3), and leads to a 2% reduction in the prediction for f/* (0). 
More precise estimates from chiral perturbation theory and lattice techniques are surveyed 
in the reviews cited in the Kgg section. 


7.2.5 6 Decay 


p decay refers both to free neutron decay and to nuclear beta decay. These and closely related 
processes are listed in Table 7.3. The amplitude for the free neutron decay corresponding 
to Figure 7.1 is 


M= a ten (1—77) w (pl wy" (1 — 7°) d |n) cos Ac. (7.112) 


v2 
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TABLE 7.3 £ decay and related processes.® 


n —> pe” Ve Neutron decay 

(N,Z) > (N-1,74+l)e"R Nuclear (heavy nuclei, e.g., in reactor) 
(N,Z) > (N +1,Z — 1)etve Nuclear (light nuclei, e.g., in Sun) 

€ p —> ven Electron capture (atomic electron) 
UTP —> Vyn u capture (muonic atom) 

Ven —> ep Inverse 8 decay 

Dep & etn 


"N and Z are the numbers of neutrons and protons in a nucleus. All of these processes are driven by the 
cos 0c part of Hs, in (7.20). 


The Vector and Axial Form Factors 


We have seen in Chapter 2 that the matrix element of the vector current is restricted by 
Lorentz and translation invariance, and by the reflection invariance of the strong interac- 
tions, to be of the form 


iot” 


(pltiy"d(x)|n) = üp |7" Fı (4°) + qu Fola?) +q F3(7?)| unet”, (7.113) 


2m 


where q = pp— Pn and we have used the Gordon decomposition in Problem 2.10 to eliminate 
possible üp (ph + ph )un and Upo"” (ppv + Pnv)Un terms. Time reversal invariance implies that 
Fi 2,3 are relatively real (Appendix G). We also have that the hadronic vector current is 
uy'd = V!” + iV7", where V! are the Noether currents related to the isospin generators. 
Equivalently, they are in the same isomultiplet as the isovector part V3 of the electromag- 
netic current, Jọ = V? + av’ (CVC). The matrix elements are related by the SU(2) 
relation (a|V*|b) x 7,/2, where a,b = p or n and i = 1,2,3. The isovector part of Jọ can 
be separated from the isoscalar V8 by considering the difference between the corresponding 
proton and neutron form factors, 


II 


F (0) 
F(0) 


FY (0) = FP(0)— F? (0) =1 


a (7.114) 
Fy (0) = Kp — fn = 1.79 + 1.91 ~ 3.70, 


2 
2 


where FY’ (0)(F{'(0)) = 1(0) is the proton (neutron) electric charge, Kp, are their measured 
anomalous magnetic moments, and F Y 9 are the isovector form factors defined in Section 
2.12.4. The F2(0) effect, known as weak magnetism, has been observed in the e* spectra 
in 2B >" Ce-p, and 1? N >!*Cet1,, in agreement with the CVC prediction. CVC also 
predicts that F(q?) = 0, since OHV = 0. (This also follows from G-parity.) However, 
its contribution would be proportional to me and small even if it were present. For most 
processes q? is small enough that its effects can be ignored. 

The vector current form factors are therefore under control for neutron decay. In fact, 
by the Ademollo-Gatto theorem, corrections are of second order in isospin breaking and 
therefore tiny, i.e., (ma — Mu)”, Za”, and Za(mg — Mu), where we have included a Z for 
the case of nuclear decay. 

The axial matrix element is more difficult, because the axial generators of SU (2)z x 
SU(2)p are spontaneously broken. The most general form consistent with P, Lorentz, and 
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translation invariance is 


(play"'d(x)|n) = (p|A™ (a) + iA% (x)]n) (7.115) 
igh? 
= tip |g) + avg") + 9798 (4°) | une’, 

where we have again used the Gordon decomposition. Time reversal requires that g1, 92, 
and g3 are relatively real,!? and go is required to vanish by G-parity (Appendix G). As 
mentioned following (5.112) on page 192, the induced pseudoscalar form factor g3 is present 
because of the chiral symmetry breaking, which generates a Goldstone boson pole. However, 
the effect of this term is usually negligible!’ because it generates a contribution proportional 
to me. gi(0) = 1.272(2) is measured in the rates and various parity-violating asymmetries in 
neutron and nuclear (6 decay. Unlike the vector form factor, there is no nonrenormalization 
theorem to prevent gi(0) from being changed from its pointlike value of 1, even in the chiral 
limit. The proof of nonrenormalization for the vector generators F relied on the fact that 
F"'|a), where a is a single particle state such as a pion or nucleon, can only have a nonzero 
matrix element with another state with the same invariant mass as a, and there were no 
such states with the necessary quantum numbers. For the axial generators, however, the 
pions become massless Goldstone bosons in the symmetry limit, so one can have nonzero 
matrix elements, e.g., (tN|F*|N). 

To understand this better, we outline the Adler- Weisberger relation (Adler, 1965a; Weis- 
berger, 1965). Recall that the commutators of the axial SU(2) generators are 


[PP FI] Steg F", i,j,k = 1,2,3. (7.116) 


Just as we did for the Ademollo-Gatto theorem, take the matrix element of (7.116) between 
single particle states. In this case, we will use protons and ignore isospin breaking. Then, 


1 ee 
5 do boas: (2, $2) 2F* |p(Pr, $1) = (21)°5° (P> — Pr) 2B 
81,82 


i (7.117) 
= 5 3 deas: (P(P2, 92)| [FY + iF”, FS — iF”] [p(p1, 1). 


81,52 


Now, insert a complete set of intermediate states between the generators. The single neutron 
contribution I» is readily evaluated using (7.115): 


(27)? 6°(P2 — Bi) 
2 2E: 
M2 
= (27) (B = 2E llO (1- Fe) 
i 
> (20)°0° (p2 — P1)2F1\91(0)|?, 
where the last form is valid in the |p| —> oo frame. The remainder of the sum is 
1 
Inga = 5 So Brea S| OLE + iF mn) (nl P'S — iF p) 
ae (7.119) 
— (p| FY —iF? |m) (m| F5 + iF py}. 


i |91(0)|? Tr [077 (1 + Mn) y0? (#1 + Ma)] 


(7.118) 


1299 would have to be imaginary for a Hermitian current such as iiy7y>u, so it would have to vanish if 
time reversal holds. The same is true for the g2 in the matrix element of the electromagnetic current J6 
(where it corresponds to an intrinsic electric dipole moment), even if parity were violated. 

131t has been measured in u capture (e.g., Gorringe and Hertzog, 2015). 
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The matrix elements in (7.119) can be related to matrix elements of the 7* field using 
(3.52) and (5.119), which in turn are proportional to the strong interaction amplitudes for 
Tp — m, to obtain the Adler-Weisberger formula 


Qf? fe wdw 
2=1 A) i = ; 
gı (0)| an Te IE iby 2 M2 [on p(w) Or p(w)| , (7 120) 


where 0,+)(w) is the total cross section for 7p scattering (with a massless pion) at CM 
energy w. Using the observed cross sections (and including corrections for the off-shell 7), 
(7.120) yields g1(0) ~ 1.25(2) (e.g., Beane and Klco, 2016), close to the experimental value 
~ 1.27. This agreement was influential in establishing current algebra, and as we have seen 
relies not only on the Fermi theory but also on our understanding of chiral SU (2) z x SU(2)r 
as an approximate spontaneously broken global symmetry of the strong interactions. 


Neutron and Nuclear Decay 


To a good approximation, the hadronic matrix element for neutron decay is therefore 


(pl JR n) = tip (gv — 947") Un, (7.121) 


where 
gv = fı (0) cos ĝe ~ cos be, ga = gı (0) cos 0e. (7.122) 


It is then straightforward to work out the neutron lifetime and various decay distributions 
(Problem 7.10). The total lifetime is 


G2 a 
mt = ra= FE (lov + Sigal) f peBe (A Be)? dE (7.123) 


Me 


where A = Mn — Mp ~ 1.29 MeV. The integral,'* obtained by a calculation similar to the 
one in (7.95) for pion beta decay, is most easily done numerically, yielding 1.64m° for the 
observed value of A/m,. From the observed lifetime ~ 880.2(1.0) s (Wietfeldt and Greene, 
2011; Patrignani, 2016) one can obtain g?, + 39%. The ratio A = ga/gyv ~ 1.2723(23) can 
be extracted from various asymmetries. For example, for a polarized neutron, the angular 
distribution of the e~ w.r.t. the neutron spin direction is 


dI A? =X 
1 — 2be = í 
dcos0. i (7) ey sedan) 


where cos ĝe = pe: ên and 8. = |pe|/Ee. From gy and g4 one can determine g1(0) (actually 
gi(0)/f1(0)) and cos ĝe. Similarly, the polarization of the produced electron is just 


_ The =+$) —T(he = -§) _ 
n= eer e A (7.125) 


which approaches —1 as expected for Mme —> 0. 

For nuclear 8 decay the nuclear matrix element, either (N — 1, Z + 1J2"(x)|.N, Z) or 
(N+1,Z—1|J}(x)|N, Z), must be determined, from theory or by relating to other measured 
matrix elements. Transitions involving vi are known as Fermi transitions. Especially impor- 
tant are the superallowed transitions between 07 states in the same isomultiplet (Hardy and 


14This is known as the Fermi integral. A more detailed treatment includes a factor F(Z, Ee) that corrects 
for the distortion of the e+ wave function by the Coulomb field of the nucleus and atomic electrons. 
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Towner, 2015). These are pure Fermi transitions, with the relevant form factors determined 
by the isospin generators. Hence, there is little theoretical uncertainty except for second 
order isospin violation (the Ademollo-Gatto theorem). The superallowed transitions yield 
the cleanest determinations of cos ĝe (i.e., the element Via of the quark mixing matrix in the 
full SM). The axial currents lead to Gamow-Teller transitions, such as those with AJ = +1. 
Mixed transitions, such as neutron decay, involve both. There has been an enormous amount 
of experimental and theoretical work on nuclear and neutron 8 decay. The formalism has 
been developed for arbitrary admixtures of S,P,T,V, and A interactions (Jackson et al., 
1957). Many different types of asymmetries, polarizations, and correlations have been mea- 
sured or searched for. These have established the basic V — A nature of the interactions; 
tested the C, P, T, and G-parity properties; searched for perturbations on V — A from such 
types of new physics as extended gauge groups involving V + A couplings, leptoquark in- 
teractions, and mixing between the ordinary neutrinos or other fermions with heavy exotic 
states; tested the radiative corrections, which require the full standard model; extracted the 
form factors; and determined V,,q for universality tests. Detailed accounts may be found 
in (Commins and Bucksbaum, 1983; Renton, 1990; Deutsch and Quin in Langacker, 1995; 
Herczeg, 2001). Recent reviews include (Severijns et al., 2006; Abele, 2008; Dubbers and 
Schmidt, 2011; Bhattacharya et al., 2012; Cirigliano et al., 2013; Vos et al., 2015). 


7.2.6 Hyperon Decays 


There have been extensive studies of the semi-leptonic hyperon decays. These include the 
AS = 0 decays n + pe“, EE > Aetve(De), U7 4 Ele De, and Z7 — Bey, as well 
as the AS = AQ = 1 decays A > pl De, ET > nl De, B7 > Al“, E7 4 Dl HH, and 
=° = E+- p, where £ =e or p. 

In the SU(3) limit the hyperons transform as an octet, as shown in (3.90) on page 105 
and Figure 3.1. Since the hadronic weak currents also transform as octets, one can use 
SU(3) symmetry to related the matrix elements. For an arbitrary SU(3) octet of operators 
OÏ j =1---8, one has that 


(Br|O"|B;) = ifizxOr + dijkOp, (7.126) 


where the f and d coefficients are defined in Table 3.2, and Or p are two reduced matrix 
elements that are independent of i, j, and k. This is the analog of the Wigner-Eckart theorem 
for SU(2) and has already been used in (3.102) in the derivation of the Gell-Mann-Okubo 
formula. The hadronic vector currents are associated with the SU(3) generators, so only 
the fijk term is non-vanishing, i.e., 


(Bx|V;i(0)|.B;) = ifizetk ut fil), (7.127) 


with q = px — p; and fı(0) = 1, exactly analogous to (7.86). The Ademollo -Gatto theorem 
applies to the hyperons as well as the pseudoscalars, so one expects the predictions to be 
quite accurate, fı(0) = 1 + O(e2), where the coefficient of the correction can depend on 
i, j, k. One can also include weak magnetism corrections to (7.127), which are also related by 
SU (3). The predictions for the vector and axial form factors are summarized in Table 7.4. 

The axial current matrix elements can have both f and d type contributions. It is 
conventional to define the coefficients F and D by 


(B| Ai, (0)|B;) = [ifijeF (a?) + dijnD(q?)] ükyu uj, (7.128) 


where F = F(0) and D = D(0). The corrections are first order in SU (3) breaking, so the 
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TABLE 7.4 SU(3) predictions for the values of the vector and axial form factors at q? = 0. 


Decay Vector Axial 

AS =0 n— pe De 1 F+D 
Dt > Aet ve(De) 0 [3D 
ET > Le Ve V2 V2F 
EO — Be a, 1 F-D 

AS =1 A > pl ig -,/3 — 3(F +2) 
D- > ni =i -(F — D) 
Z- > Aie 3 3(F— 2) 
z7 + DA Fw V2(F + D) 
2 > EH 1 F+D 


predictions are less reliable than for the vector currents. F and D and the q? dependence 
must be taken from experiment. One combination is obtained from the g;(0) from 8 decay 
in (7.115), 
1 1 
— 5 — . 1 i. . 

(pltypy dln) = (75 (Bs + iB7)|A,, + Al g +iBs)), (7.129) 
yielding F + D = g,(0). There have been extensive studies of the decay rates, the £ energy 
spectrum, correlation between the pe and the initial hyperon polarization, the ?—v angular 
correlations, and the final baryon polarization, both to test the Fermi theory and to obtain 
an accurate value for sin 0e. The data agree quite well with the theoretical expectations, 
and yield 


F =0.462(11), D =0.808(6), Vas (~ sin 8e) = 0.226(5) (7.130) 


from an overall fit (Gaillard and Sauvage, 1984; Cabibbo et al., 2003; Mateu and Pich, 
2005). 


7.2.7 Heavy Quark and Lepton Decays 


The r+ lepton, with m, ~ 1.777 GeV and lifetime ~ 2.9 x 10713 s, was the first member 
of the third family of quarks and leptons to be discovered. It is similar to the electron and 
muon except for its large mass, m,/m, ~ 17 and Mmr /Me ~ 3500. It can therefore undergo 
both leptonic decays, 77 + v,l~i%, L = e or p, and semi-hadronic (also referred to as 
hadronic) decays, rT > v;dii or v,st, with the quarks emerging as mesons such as m~ or 
n°. The T decays have been very useful in testing the Fermi theory (and SM) couplings, 
extracting the QCD coupling at the 7 scale, and searching for new physics (see, e.g., Davier 
et al., 2006; Pich, 2014; Patrignani, 2016). 

The leptonic decays are very similar to u decay, with branching ratios ~ 18% each for the 
e and u channels, consistent with the simple parton model expectation that the branching 
ratios for the e, u, and gg decays should be in the ratio 1, 1,3 (because of the 3 quark colors). 
They have been extensively studied (Patrignani, 2016), to establish the V — A nature of 
the 7 current and to verify weak universality, i.e., that the strength of the interaction is the 
same as the one for muon decay, as is expected in the SM extension of the Fermi theory. For 
example, the Michel p parameter, analogous to the Michel parameter for u decay in (7.55), 
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is measured to be 0.745(8), consistent with V — A value of 3/4. Similarly (Pich, 2014), 


eG. GZs G2 
ate . ae =]. 2T on 992 131 
G 1 002(3), G 006(3), G 0.99 (5), (7 3 ) 


2 = 2 2 = 

F F F 
where, e.g., Gre is the coefficient of the four-fermi interaction measured for T~ > vre~ De. 
The values in (7.131) are in reasonable agreement with universality. 

The semi-hadronic decays T~ —> hv, include h = m~ (~ 11%), m77? (~ 26%), and 
TT +n7?(n < 4), as well as modes involving 3 or 5 charged particles and modes involving 
kaons (the latter are suppressed by sin? ĝe and by phase space). The t~ + m~ v, decay rate 
can be predicted using fr extracted from 7 — uv (Equation 7.66), leading to the effective 
coupling quoted in (7.131). The semi-hadronic decays can also be used to determine the 
strong coupling a,(m2), the CKM matrix element V,,, ~ sin ĝe, and were historically useful 
for setting a direct upper bound ~ 18 MeV on the v, mass in modes involving 3 or 5 charged 
pions (for which little phase space is available). 

There have also been extensive searches for rare decays, such as T —> ¢y, which are 
forbidden in the Fermi theory and standard model up to negligibly small neutrino mass 
effects, but could be generated by new physics (Section 8.6.6). 

In the four-quark model, the c decays via the hadronic weak current 


Ji = 5y (1 — y°)e cos be — dyp (1 — 75 )c sin be + u terms (7.132) 


in (7.23), i.e., to s and d with relative strengths cos ĝe and sin 0e. In the 3-family extension, 
the c decays are little affected because the mixing between the third family and the first 
two is small. However, these small effects allow b quark decays via 


IB = ty (1—7°)b Van + Oy (L— P)bVea ts (e 


where V,,, and Ve are the relevant elements of the CKM matrix in (7.24). Empirically 
[Vao] < |Veo] < |Vus| ~ sin ĝe. The small values for |Vus| and |Ve| lead to longer lifetimes 
for the b-flavored hadrons than would be naively expected. 

Measurements of two-body leptonic decays Dt = cd > ptv,, DE = c5 > utvp, 
Di => rtv, and B~ + T~ p, have been made by BaBar, Belle, BES, and CLEO, allowing 
extraction of the decay constants (analogous to fr and fx). The results are in reasonable 
agreement with the theoretical expectations, mainly based on lattice calculations (Rosner 
et al., 2015), although there are some < 2c tensions. 

The c quark pole mass, at ~ 1.6—1.7 GeV (Equation 5.124), is marginally large enough to 
attempt to use use simple parton model (SPM) ideas to describe its other decays, especially 
for the semi-leptonic modes. In the spectator model one ignores the hadronic aspects of 
the initial charmed (or b-flavored) hadron and simply considers the decay of a free quark, 
treating the light quarks or antiquarks as irrelevant spectators. For example, one would 
expect branching ratios for c > (s or d)etve ~ 20% by naive counting arguments, compared 
to the observed 16% for Dt decays. The Dt lifetime ~ 1.04x 1071? s is also close to the naive 
free quark estimate of (m,,/m_)°7,,/5 ~ 6 x 10713 s (obtained from scaling the y lifetime, 
obtained for me ~ 1.6 GeV and neglecting the final hadron masses). However, the spectator 
model is less successful for other charmed hadrons. For example, the D? = cū meson would 
have the same lifetime and semi-leptonic branching ratio, whereas experimentally B(D° > 
et + X) is only ~ 6.5%. (The lifetime, 4.1 x 1071}? s, is actually closer to the naive scaling 
estimate.) The charm mass scale is clearly too low for the SPM to be reliable, especially for 
the non-leptonic modes, and there must be important contributions to the decay amplitudes 
in addition to the simple c-quark decay diagram. More detailed descriptions employ various 
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phenomenological and non-perturbative QCD techniques (e.g., Artuso et al., 2008b; Ryd and 
Petrov, 2012; Butler et al., 2013). For example, it is convenient to classify the contributions 
in terms of topological diagrams (Chau, 1983), as shown in Figure 7.8. It is also useful to 
distinguish between Cabibbo favored decays like c + sud (no powers of sin ĝe in amplitude); 
Cabibbo suppressed, like c — dud or c => sus (one power); and doubly suppressed, like 
c — dus (two powers). SU(3) and factorization assumptions are often employed as well. 


q q d,s q d,s q q q d,s q q q 


Si \ i rd | \ j | q i T 
a PA j | d | Á] # 7 
\ tj V y 
Iw \ Á 
Y 4 1 4 fi hS 
c q c q c q i a 


Figure 7.8 Tree-level topological diagrams for D decays, written in terms of an in- 
termediate vector boson W. Gluons may be attached in all possible ways. The first 
(spectator) diagram represents free quark decay, the second (internal emission) di- 
agram has a reduced color factor (analogous to (5.15)), the third is the exchange 
diagram, and the last is the annihilation diagram. There are additional one-loop 
(penguin) diagrams (Section 8.6). 


The SPM estimates for the decay of the b quark are expected to be more reliable because 
of its larger pole mass of 4.8 — 5.0 GeV. For example, the B+ = bu semi-leptonic branching 
ratio into (trp + X of 10.99(28)% (this is the average of etve and u*v) agrees with the 
naive SPM expectation of 1/9, based on possible decays into three lepton and two quark 
families. The B° = bd + ¿tv + X branching ratio is similar (10.33(28)%), and the B+ and 
B® lifetimes agree to within 7%. 

One can use the semi-leptonic lifetimes to determine Vo, either using the inclusive decays 
(summed over all charmed hadronic final states), with appropriate QCD corrections and 
quark masses (Benson et al., 2003), or the complementary exclusive measurements of specific 
decays such as B > Dem or B > D* li. The latter require measurements of the shapes 
of form factors and HQET and lattice calculations of their normalization (see the review 
of Va and Vu» in Patrignani, 2016). The results, which are only marginally consistent, are 
|V| = 0.0422(8) (inclusive) and |V,,| = 0.0392(7) (exclusive), with a combined value 


|Veo] = 0.0405(15), [Va] = 0.00409(39). (7.134) 


The uncertainty has been increased because of the discrepancy, using the PDG pro- 
cedure (Patrignani, 2016). This small value corresponds to the relatively long lifetime, 
Tg+ ~ 1.6 x 1071? s, comparable to Tp+ despite the much larger B mass, determined 
by the displaced vertex for the decay ~ 0.5 mm from the production vertex. The much 
rarer b — u transitions can also be studied by the inclusive semi-leptonic decays (ex- 
ploiting the higher energy leptons that are not kinematically allowed in b > ci), or in 
the exclusive B + li, decays (with similar form factor complications as in the charm 
modes). There is again tension in the two methods, with |V,,.,| = 0.00449(23) (inclusive) 
and |V,,»| = 0.00372(19) (exclusive), yielding the value for |V,,»| in (7.134), again with an 
increased uncertainty. 
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B decays are reviewed in (Artuso et al., 2009; Butler et al., 2013), while the charmonium 
(c@) and bottomonium (bb) systems are described in (Eichten et al., 2008; Brambilla et al., 
2011; Patrignani et al., 2013). The K° — K°, D° — D}, and B° — B° systems exhibit or are 
predicted to have interesting (second order weak) mixing and CP violation effects that will 
be described along with other aspects of CP violation in Section 8.6. 

The t quark, with pole mass ~ 173 GeV, is much heavier than the WF mass of ~ 80 
GeV (Chapter 8). It is therefore predicted in the SM to decay directly to (Wt (or much 
more rarely into sW* or dW*) with the W on shell (Problem 8.15). Because it is so heavy, 
the t decays without hadronizing. Its production and decays are excellent probes of QCD, 
the electroweak theory, and Vi, (Déliot et al., 2014; Kréninger et al., 2015; Boos et al., 
2015, and Chapter 8). The top couples strongly to the Higgs in the standard model, and 
its mass is critical to the issue of vacuum stability and to the Higgs mass prediction in 
supersymmetry (Section 8.5). It also plays an important role in many alternative models of 
spontaneous symmetry breaking and other types of BSM physics (e.g., Atwood et al., 2001; 
Zhang and Willenbrock, 2011; Boos et al., 2015). 


Lepton Energy Distributions 


We conclude this section with a comment on lepton energy distributions. In the four-fermi 
V —A theory, neglecting the final fermion masses, strong interaction effects, etc., the energy 
distribution of each final fermion i is either 2e?(3 — 2e;) or 12e? (1 — €;), where e; = 2E;/M 
and M is the mass of the decaying particle. The first (second) distribution holds for the 
e` (De) in uT — € VpDe, as is seen in (7.50) and Problem 7.2. The difference is due to the 
sign of the last term in (7.38), which in turn is associated with the relative V — A or V + A 
character of the currents (see (2.175) on page 37). In particular, the relevant product of 
currents for b > ce~ De is 

Gy (1 — 7°)b ey (1 — 7 ve. (7.135) 


The b and e” fields are related the same as for u decay in (7.32), so the e~ energy distribution 
will be of the 2e?(3 — 2e) type. For c + setve, on the other hand, the currents are 


Sya (1 — 7°)e Dey” (1 — 7 )e = —5y,(1— 7 )e eye (1+ ys, (7.136) 


where we have used the charge conjugation identity (2.302) on page 56. The positron current 
is therefore V+A. This will give the other sign in (7.38) and a positron spectrum 12e (1 — €). 

These considerations are important for probing possible extensions or modifications of 
the V — A theory or the standard model. For example, if the b — c transition had been 
V + A, then the e~ spectrum type would have been reversed. 


7.3 PROBLEMS 
7.1 Calculate the neutrino phase space coefficients A and B defined in (7.44) for u decay. 


7.2 (a) Suppose that the muon weak current were V + A rather than V — A, so that the 
interaction for u decay becomes 


G 
H= Ja u (1 +7") Beye (1 -7) Ve 


rather than (7.32). Calculate the electron energy distribution, and show that it corresponds 
to p = 0, where p is the Michel parameter defined in (7.55). Hint: the results for each part 
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can be written down without additional trace calculations. 
(b) Calculate p for the (parity-conserving) vector interaction 


H = V2Gr Pu Ip EYF Ve, 


where the V2 is inserted to obtain the same overall rate. 

(c) Return to the Fermi V — A theory for unpolarized 47 — ev. Give the energy 
distributions for the v, and for the De. In each case the momenta of the other two particles 
are integrated over. Give a simple helicity argument why one of them vanishes at the 
endpoint, € = 1, but not the other. 


7.3 Verify (7.58), (7.60), and the kinematic limits on y for elastic ve scattering. 


7.4 Discuss the difference between the QED corrections to u and p decay in the Fermi 
theory. Hint: Use an effective interaction 


G 
-Lg = Fe Plu (1-7?) n 8" (1-9?) ve 


for @ decay, and write the four-fermi interactions in each case in charge retention form. 


7.5 The formula (7.53) for polarized ~~ decay implies that the angular distribution is 
1 — cos@ for a massless e~ of the maximum allowed energy m,,/2, where 0 is the angle 
between the e~ direction and the muon spin. The analogous formula for z+ decay is 1+cos 0. 
Interpret these results in terms of angular momentum conservation. 


7.6 Use the spin projection in (2.178) on page 37 to verify the cos @ = p, - 8, dependence 
of the expression (7.53) for polarized u decay. Hint: show how the calculation leading to 
(7.49) is modified by the spin projection. 


7.7 Use the formalism developed in Appendix D to verify the 3A/2m, correction in (7.96) 
to the pion beta decay rate. Neglect me. 


7.8 Verify the K3 decay rate in (7.98) in the special case my ~ me ~ 0, fy (t) ~ f4(0), 
and f_(t) ~ 0. 


7.9 Use reflection invariance to show that (n'(p2)| AŻ |n" (p1)), the matrix element of the 
axial current between single pion states, is zero. 


7.10 Calculate (a) the neutron lifetime, (b) the e~ polarization, and (c) the electron 
asymmetry with respect to the neutron spin direction in terms of gy and ga defined in 
(7.121). Work in the approximation that the proton momentum is negligible compared to 
its mass. Hint: write out the amplitude explicitly using the Pauli-Dirac representation for 
the spinors and y matrices. Use the helicity basis for the leptons. 

(d) Repeat the lifetime calculation, this time using trace techniques, summing and averaging 
over all spins, and calculate the ev angular correlation coefficient a-,. The latter is defined 
by the angular distribution 1+ de,8efe-py, where p, can be measured using py = — (Pp + Pe). 


7.11 Calculate the predicted rate and the angular distribution of the 7~ with respect to 
the 7 spin direction for 7~ —> m~ v, in the Fermi theory. Show, using the measured lifetime 
T, = 2.9 x 107} s, that it leads to a branching ratio in agreement with the observed value 
of 11%. Neglect my. 


CHAPTER 8 


The Standard Electroweak 
Theory 


DOI: 10.1201/b22175-8 


In this chapter we describe the standard SU(3) x SU(2) x U(1) model in detail, including 
the structure of the Lagrangian density, the spontaneous symmetry breaking mechanism, 
and the rewriting of the Lagrangian after SSB. The electroweak gauge interactions, the 
properties of the W and Z, the Higgs, and CP violation are described, while neutrino 
mass and mixing is considered in Chapter 9. More detailed treatments may be found in the 
books and reviews listed in the bibliography, while the historical development is described 
in (Weinberg, 2004) and (Quigg, 2015). 


8.1 THE STANDARD MODEL LAGRANGIAN 


The standard model is based on the gauge group G = SU(3) x SU(2) x U(1). The SU(3) 
(QCD) factor has gauge coupling gs and 8 gauge bosons (gluons) G’,i = 1---8. It is non- 
chiral, and acts on the color indices of the L- and R-chiral quarks qra, where a = 1,2,3 
refers to color and r to flavor. QCD was described in detail in Chapter 5. However, the 
bare (current) masses introduced in (5.2) on page 160 are not allowed in the context of 
the full standard model, but must be generated by the Higgs mechanism. QCD itself is not 
spontaneously broken, and the gluons remain massless. 

In contrast to QCD, the electroweak SU(2) x U(1) factor is chiral. The SU(2) group has 
gauge coupling g, gauge bosons W*,i = 1, 2,3, and acts only on flavor indices of the L-chiral 
fermions. It leads to the charged current interactions of the Fermi theory, and also includes 
a neutral boson W° associated with a phase symmetry. The abelian U(1) factor has gauge 
coupling g’ and gauge boson B. It is also chiral, acting on both L and R fermions but with 
different charges. After spontaneous symmetry breaking (SSB), SU(2) x U(1) is broken to 
a single unbroken U(1)g, incorporating QED with the photon a linear combination of W? 
and B. The orthogonal combination (Z), as well as the W~, acquire masses. G is sometimes 
written as SU(3). x SU (2)z x U(1)y. The subscripts have no group-theoretic significance, 
but refer to the physical application, i.e., c refers to color, L to the left-chiral nature of the 
SU(2) coupling, and Y to the weak hypercharge quantum number. 

The standard model Langrangian density is 


L= L gauge + Ly F Lo T Ly uk) (8.1) 


which refer, respectively, to the gauge, fermion, Higgs, and Yukawa sectors of the theory. 
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There are in addition ghost and gauge-fixing terms that enter into the quantization. The 
gauge terms are 
1 i vi 1 i vi 1 v 
Lauge = ~ 7S" = gW W" y Pu B" ; (8.2) 
where the field strength tensors for SU (3), SU (2), and U (1) are, respectively, 


Gi, = 3 G? — 0,G*, — gsfijxGGt, ijk=1--8 
Wi, = 3 W$ = awi = Jeij WIWE, iJ, k=1...3 (8.3) 
Buy = ô B, — ð Bp. 


These include the gauge boson kinetic energy terms as well as the three- and four-point self- 
interactions for the G’ and W’, as shown for an arbitrary non-abelian theory in Figure 4.1. 
The abelian U(1) gauge boson has no self-interactions. We ignore the possibility of FF 
terms such as the Ogcp one in (5.2). 

The fermion part of the standard model involves F = 3 families of quarks and leptons. 
Each family consists of 


0 Up 0 Un 
L — doublets : qm; = ( ite ) ; lmt = ( o ) 
dejz emn JE (8.4) 


. 7 0 0 0 0 
R — singlets : UmR> dm Rs €mR: YmR> 


in which the L-chiral fields are SU(2) doublets and the R fields are singlets, leading to 
parity breaking in SU(2). The superscripts ° refer to the fact that these fields are weak 
eigenstates, i.e., they have definite gauge transformation properties, with the elements of 
each doublet transforming into each other under SU(2), and m = 1,2,3 labels the family. 
After spontaneous symmetry breaking, these will become mixtures of mass eigenstate fields. 
The u? and d? are quarks, which will (after SSB) be identified as having electric charges 
2/3 and —1/3, respectively. There are altogether 2F = 6 quark flavors (u? and d? for each 
family). Each carries a color index u% £ pq OF dir Ra not displayed in (8.4), so there are 
really 3 quark doublets per family. The SU(2) and SU(3) commute, so the QCD interactions 
do not change the flavor, and vice versa. v’ and e? are the leptons. They are color singlets 
and will have electric charges 0 and —1. We have tentatively included SU(2)-singlet right- 
handed neutrinos v9, p in (8.4), because they are required in many models for neutrino mass. 
However, they are not necessary for the consistency of the theory or for some models of 
neutrino mass, and it is not certain whether they exist or are part of the low-energy theory. 
All of these fields except the v?, r carry weak hypercharge Y, which is defined by 


Y =Q-T}, (8.5) 


where TẸ is the third generator of SU (2)z and Q is the electric charge. Weak hypercharge 
should not be confused with the strong hypercharge Y defined in (3.89) on page 104. U(1)y 
commutes with SU(3). and SU(2)z, so it has the same value for all members of SU(3) x 
SU(2) multiplets. The Y eigenvalue is y = q — t} = 4, 2, and i for q? r, u? p, and d? p, 
respectively. For the leptons, y = —4,0, and —1 for 4 z, V}, p, and e? p. 

The representations can be summarized by the symbol {n3, n2, Y}y for fermion w, where 
n3 and nz are the SU(3) and SU(2) representations and y is its hypercharge. Thus, the 16 


1Some authors define Q — T? = Y/2. 
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fields (15 without v°, p) of each family transform as 


1 
pai > qg, =+1/2+1/6=2/3 
qa, = —1/2 + 1/6 = -1/3 
1 
{2-5}, > WV? , = +1/2—1/2=0 
qo, =-1/2-1/2=-1 
2 
{3, 1, oe => Guo a = 2/3 (8.6) 
1 
{3, 1, —ghe > qo,=-1/3 
{1,1,—=1}e = Geo p =-_1 
{1,1,0} | => W 2 = 0, 


where q = t} + y is the electric charge eigenvalue. 

The fermion representation is highly reducible, since each weak eigenstate family trans- 
forms only into itself. The existence of three families is empirical. Lẹ is actually invariant 
under a global U(3)° family symmetry, in which the 3 families of q? z, 0,7, ue R, rs Cnr: 
and v? p transform into each other. However, there is no evidence that these symmetries 
are gauged. In any case most of the generators are broken by the Yukawa interactions. (The 
unbroken vector generators are baryon and lepton number.) 

The SM is anomaly free (Section 4.5) for the assumed fermion content. There are no 
SU(3)° anomalies because the quark assignment is non-chiral, and no SU (2)? anomalies 
because the representations are real (Section 3.1.3). The SU(2)?Y and Y? anomalies can- 
cel between the quarks and leptons in each family, by what appears to be an accident. 
The SU(3)?Y and Y anomalies cancel between the L and R fields, ultimately because the 
hypercharge assignments are made in such a way that U(1)g will be non-chiral. 

The SU (2)z and U(1)y representations are chiral, so no fermion mass terms are allowed.” 
Lyp therefore consists entirely of gauge-covariant kinetic energy terms, 


F 

L= 5 (aari Paar + lmriDPonr ae 
m=1 ( : ) 

+ Up piDur+ dp piDdr rn + enrridenrtimriPrnrr) > 


where we have allowed for an arbitrary number F of fermion families. The first term in (8.7) 
is 


3 
GmitiPamr =i 5 ( an die a” 
a : (8.8) 
ig. w , ig’ igs > UnL 
x | (a.1+ Saw, + £18, | bas + Sean Gul ( d? B $ 
mLB 


where the J is the 2 x 2 SU(2) identity matrix. It is clear from (8.8) that the SU(3). and 
SU(2), x U(1)y groups commute. We will simplify the notation by suppressing the color 


?The possibility of Majorana mass terms for the Vo will be considered in Chapter 9. 
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indices on the quark fields. Then, the fermion gauge covariant derivatives become 


ig, > ig’ 2ig' 
Dude = (a, 7 Du W, + TB.) dmi Dpumr = (a, + “5, Ue 

a p 
Dylmi = (a, | ai Wu 5 B.) lnL» Dudmr = (a, — Te) dog 
Dyer = (Oi = ig’ By) el p D vf R = Oe es (8.9) 


where it is understood that there are also gluon couplings for the q? z, u?, p, and d? p. The 
fermion gauge interactions can be read off from (8.7). 
The Higgs part of £ is 


Le = (D"$)' Dug — V (¢), (8.10) 
ot 
where ¢ = ( ) is a complex Higgs scalar, transforming as {1, 2, shoe Its adjoint 


{1,2*,-F}gt is of = ( Sot |: The gauge covariant derivative is 


. - 
Duo = (a+ 47.%,+£5,) $. (8.11) 


The square of the covariant derivative leads to three- and four-point interactions between 
the gauge and Higgs fields. V (¢) is the Higgs potential. The combination of SU (2) x U(1) 
invariance and renormalizability restricts V to the form 


V$) = +p ph +A)’, (8.12) 


where ¢'¢ = ¢- ot + 6°" ¢°. For u? < 0 there will be spontaneous symmetry breaking, and 
the VEV of (0|¢°|0) will generate the W and Z masses. The \ term describes a quartic 
self-interaction between the Higgs fields. Vacuum stability requires À > 0. 

The last term in (8.1) represents the Yukawa couplings between the Higgs doublet and 
the fermions, which are needed to generate fermion masses by the spontaneous breaking of 
the chiral gauge symmetries. For F fermion families it takes the form 


F 
LYuk =— > [Pind m2 bun +ViinImt?dnr 
m,n=1 (8.13) 
+ Tl iedeoR + Mine R] + hice 
where ot g got 
6=(% ), bzi % | (8.14) 


are, respectively, the Higgs doublet and a conjugate form that will be discussed below. 
TX, T? T°, and T” are completely arbitrary F x F matrices, which ultimately determine the 
fermion masses and mixings. They do not have to be Hermitian, symmetric, diagonal, or 
real (the hermiticity of Lyuk is ensured by the addition of the h.c. to the displayed terms). 


3One can even generalize to non-square matrices of dimension Fz x Fr, where Fr, # Fp are the numbers 
of L- and R-chiral fermions of a given type Y. This would guarantee the existence of one or more massless 2- 
component fermions wz or WR unless there are also Majorana mass terms. Neither option is experimentally 
viable except for the neutrinos. 
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They are the most arbitrary aspect of the SM, introduce most of the free parameters, and 
break most of the U (F)? family symmetries of the rest of £L. The T” term would of course 
be absent if there are no v®,,. For example, the mn term of T? yields 


Pendent Gdnr =Ton [trot dar + dho dR] (8.15) 
and its Hermitian conjugate 
ri ndoro dhar = TEn [do RO UML + dond dr] ; (8.16) 


which lead to the interaction vertices in Figure 8.1. Electric charge is conserved at the 


0 0 0 0 
UmL dmt dar dnr 
4 Fa 4 4 
/ “nd f spå / nde f side 
Paan =", aaa | re. Pea irh, Pa, are 
X ‘ \ t 
i 0 i 0 ' 0 l 0 
dar dnr UmL dmt 


Figure 8.1 Yukawa couplings associated with r4 „g2, od? p + h.c. 


vertices (guaranteed since Q is embedded in G), while chirality is flipped, which is charac- 
teristic of a Yukawa vertex. The family is changed for m 4 n. After SSB, the VEV (0|¢°|0) 
will generate effective mass terms for the d° and other fermions. 

It is straightforward to write the Yukawa couplings involving T%°® in (8.13). However, 


ð- 
transforming as {1, 2, —the so that one can write invariant terms g°,,®u? p + h.c. and 
4 0 p? r t h.c. Neither ¢ nor ¢' have the needed quantum numbers. However, in the SM 
one can avoid the need to introduce a second Higgs doublet ® by utilizing the “tilde trick,” 
namely that @ defined in (8.14) indeed transforms as {1, 2, =a That is because the 2* 


representation of SU(2) is equivalent to the 2, as shown in (3.17) on page 93. Thus, $ can 
play the role of ©, with the identification 6° = ®° and —d- = ®-, 


Fring 0 ulr STan CAKA = dop uhr] . (8.17) 


The equivalence of the fundamental and its conjugate does not generalize to higher unitary 
groups. Furthermore, in supersymmetric extensions of the standard model the supersymme- 
try forbids the use of a single Higgs doublet in both ways in £, and one must add a second 
Higgs doublet. Similar statements apply to many theories with an additional U(1)’ gauge 
factor, i.e., a heavy Z’ boson, or to the SO(10) grand unified theory. One can get by with a 
single (but larger) Higgs multiplet in the extensions of the SM to the non-supersymmetric 
versions of SU (2)z x SU(2)r x U(1) and SU(5). 


the IT” and I” terms require the existence of a complex scalar Higgs doublet 6 = ( Z ) 


8.2 SPONTANEOUS SYMMETRY BREAKING 


The SM as written is not realistic because bare mass terms are not allowed for the elec- 
troweak gauge bosons or for the fermions. However, as described in Sections 3.3 and 4.3 
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effective masses may be generated by spontaneous symmetry breaking, and in fact the Higgs 
doublet @ was introduced for that purpose. If its neutral component ¢° acquires a nonzero 
VEV the SU(2)r x U(1)y electroweak gauge symmetry will be broken to electromagnetism, 
U(1)g, generating masses for the chiral fermions. By the Higgs mechanism the Goldstone 
bosons will be absorbed to become the longitudinal components of the massive WF and Z. 


8.2.1 The Higgs Mechanism 


Similar to the case of a single complex scalar in Section 3.3.3 it is convenient to rewrite ¢ 


in a Hermitian basis as 
OAN _ ( Agi + ide) 
m= ( o° ) 7 ( va (gs tiga) J’ eae) 


where ¢; = i represents four Hermitian fields. In this new basis the Higgs potential becomes 


4 4 2 
Vio = 50 (>: si) +7 (>: s) | (8.19) 


V(@) is clearly O(4) ~ SU(2) x SU(2) invariant. This is an example of an accidental 
symmetry; the most general potential consistent with the SU(2) x U(1) gauge invariance and 
renormalizability exhibits a higher symmetry. The extra (global) generators are explicitly 
broken by the Yukawa and gauge interactions. 

Without loss of generality we can choose the axis in this four-dimensional space so that 
(0|¢;|0) = 0, i = 1,2,4, and (0|¢3|0) = v > 0. (Other directions can be transformed into 
this form by an SU(2) x U(1) rotation.) Thus, 


6 (0) == (0 ) 


1 1 
V(¢) 3 V(v) = a + rae 


“a 


(8.20) 


which must be minimized with respect to v, analogous to the case of a single complex scalar 
in (3.146) on page 118 or a Hermitian scalar in (3.133). Two important cases are illustrated 
by the dashed and solid curves in Figure 3.4. For u? > 0 the minimum occurs at v = 0 
and SU(2) x U(1) is unbroken at the minimum. On the other hand, for u? < 0 the v = 0 
symmetric point is unstable, and the minimum occurs for v 4 0, breaking the SU(2) x U(1) 
symmetry. The point is found by requiring 


V'(v) = v(p? + Av”) = 0, (8.21) 


v= j= (8.22) 


at the minimum. The dividing point u? = 0 cannot be treated classically. It is necessary to 
consider the one-loop corrections to the effective potential, in which case it is found that 
the symmetry is again spontaneously broken (Coleman and Weinberg, 1973). 

We are interested in the case u? < 0, for which the Higgs doublet is replaced, in first 
approximation, by its classical value v in (8.20). The generators corresponding to Lt, L?, 
and L? — Y are spontaneously broken, 


uT al 0 -E fi 0 
=F (9) #0 i5123, Yo=55() ) #0 (8.23) 


which has the solution 
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However, the vacuum carries no electric charge (Qu = (L? + Y)v = 0), so the U(1)g of 
electromagnetism is not broken,* and SU (2)z x U(l)y + U(1)Q. 

We therefore expect the photon A, associated with the unbroken generator Q = L? +Y, 
as well as the eight gluons, to remain massless, while W+ = (Wt + iW?)/\/2 and Z, 
associated with T3 — Y, become massive. To see this, one must quantize around the classical 
vacuum, i.e., write 6 = u+¢’, where ¢’ are quantum fields with zero vacuum expectation 
value. To display the physical particle content it is useful to rewrite the four Hermitian 
components of ¢’ in terms of a new set of variables using the Kibble transformation, as 
discussed for a general gauge theory in Section 4.3. 


j= eee ( ae ) (8.24) 


where the L” are the three broken generators Lt, L?, and L? — Y, and H is a Hermitian 
scalar field, the physical Higgs boson. 

If we had been dealing with a spontaneously broken global symmetry the three Hermi- 
tian fields ¿f would be the massless pseudoscalar Goldstone bosons. These would have no 
potential and would only appear as derivatives. In a gauge theory they disappear from the 
physical spectrum, as we saw in Section 4.3. It is useful to quantize in the unitary gauge, 


pag =e E Th ya) 3> “a (8.25) 


along with the corresponding transformations on the other fields. The unitary gauge is 
simplest for displaying the particle content of the theory, because the Goldstone bosons 
disappear and only the physical degrees of freedom remain. As described in Section 4.4 it is 
better to use other gauges for calculating higher-order contributions to physical processes 
because the unitary gauge is highly singular, and delicate cancellations between diagrams 
are required. 


8.2.2 The Lagrangian in Unitary Gauge after SSB 

Let us rewrite the Lagrangian in (8.1) after spontaneous symmetry breaking in the unitary 
gauge. 

The Gauge and Higgs Sectors 


The Higgs covariant kinetic energy term takes the simple form 


1 ie, Oe |p oe 
TNEA — J iqyyi f g 
(D o) D"ọ = z v) K Wid 5 B, ( H ) +H terms. (8.26) 
We will return to the kinetic energy and gauge interaction terms of the physical H field 
later, but for now we concentrate on the part depending only on v. Equation (8.26) can be 
rewritten using E 

PW? = W? 4 V2rtWts V2 WT, (8.27) 
where 


i wl = iW r Eee nee ge 
Wee tat (8.28) 


“It is automatic that a U(1) remains unbroken in the minimal theory with a single Higgs doublet. 
However, in extended versions of the SM, e.g., with two or more Higgs doublets, all four generators might 
be broken, depending on the scalar potential. See Problems (8.3) and (8.4). 
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to obtain 


v2 


4 


2 
—9 But E 


2,2 
gv +p — 1 2 12 
= wtw, + 5(9 +g“) 


(8.29) 
= 2 wWwtew- Mz H 
= My H + EEA Zu, 


where W= are the complex charged gauge bosons that will mediate the charged current 

interactions, and 

_ —g'B +gW° 
/ g? + g”? 

is a massive Hermitian vector boson that will mediate the new neutral current interaction 

predicted by SU (2) x U(1). In the second form, @w is the weak angle, defined by 


Z 


= — sin w B + cos 0w W°? (8.30) 


t t 
tan dw = I > sn Ow = CA cos Ow = A (8.31) 
g IZ IZ 


where? 
gz = V9 +g’. (8.32) 


The combination of B and W® orthogonal to Z is the photon (y), with field 
A = cos 0w B + sin 0w W”, (8.33) 


which remains massless. The (tree-level) masses are predicted to be 


gv Ag Mw 
My = = Mz = = = —_ Ma =0 8.34 
W 2? Z 2 cos 0 ) A , ( ) 
implying the relation 
: M2 
sin? Oy, — 1 We 8.35 
sin” Ow M2 ( ) 


One can think of the generation of masses as due to the fact that the W and Z interact 
constantly with the condensate of scalar fields and therefore acquire masses, as in Figure 
8.2, in analogy with a photon propagating through a plasma. Each Goldstone boson has 
disappeared from the theory but has re-emerged as the longitudinal mode of a massive 
vector. The number of field degrees of freedom is unchanged. Before SSB, there are 4 
massless electroweak gauge bosons, each with 2 helicities, and 4 Hermitian scalars, ¢;, for 
a total of 12. After SSB, there are 3 massive bosons, each with 3 helicities, the photon with 
2, and one Hermitian scalar H, again totalling 12. One sees from (8.34) that in the limit 
g' > 0 one would have Mw = Mz. That is because the global O(4) symmetry of (8.19) 
is broken to O(3) ~ SU(2) by SSB. This global custodial symmetry is respected by the 
SU (2) gauge interactions in (8.26) for g' = 0, so that Mw+ = Mws = Mz. On a deeper 
level, the custodial SO(3) ensures that the coefficient v? is the same for the WF and Z 
mass terms in (8.29), even for g’ 4 0, implying Mw = Mz cos Oy. Since this relation is 
well satisfied experimentally, alternative models of spontaneous symmetry breaking often 
involve a custodial SU(2) global symmetry to maintain it. 


5There is a confusing variety of notations for g, g’, VEE and gz in the literature (even in the author’s 


own papers). Similarly, there is no uniformity as to whether symbols such as v represent (¢°) or V2(¢?). 


The Standard Electroweak Theory 265 


W eR 


Figure 8.2 Effective masses for the W and electron generated by the VEV v of the 
neutral Higgs field. 


It will be seen below that Gr/V2 ~ g?/8M?,, where Gp = 1.1663787(6) x 1075 GeV? 
is the Fermi constant determined by the muon lifetime. The weak scale v is therefore 


v = 2Mw /g ~ (V2G r) 1? ~ 246 GeV. (8.36) 


Similarly, we will see that g = e/sin w, where e is the electric charge of the positron. 
Hence, to lowest order 


(ra/V2Gr)"? 


Mw = Mz cos 0w ~ ‘ 
sin Ow 


; (8.37) 
where a ~ 1/137.036 is the fine structure constant. We will see that sin? 0w can be mea- 
sured from the predicted neutral current scattering to have a value ~ 0.23, so one expects 
Mw ~ 78 GeV, and Mz ~ 89 GeV. (These predictions are increased by ~ 2 GeV by loop 
corrections, including the running of a.) The W and Z were discovered at CERN by the 
UA1 (Arnison et al., 1986) and UA2 (Ansari et al., 1987) collaborations in 1983, in the 
reactions pp > W + X > w + X and pp —> Z + X — €+é- + X, where X represents unob- 
served hadrons. Subsequent measurements of their masses and other properties have been 
in excellent agreement with the standard model expectations, including the higher-order 
corrections (Patrignani, 2016). The current values are 


Mw = 80.385 + 0.015 GeV, Mz = 91.1876 + 0.0021 GeV. (8.38) 


The full Higgs part of £ is 


Le =(D"6)'D, — V ($) 
2 2 (8.39) 
=M} w"+tw7 (1 + z) + 5M2Z*Z, (1 + =) + z (3 H) — V(¢). 


The second line in (8.39) includes the mass terms, and also describes the ZZH?, W+W- H? 
and the induced ZZH and WtW~H interactions, as shown in Table 8.1 and Figure 8.3. It 
also contains the canonical kinetic energy term and potential for the H. Similar to (3.140) 
on page 115, the Higgs potential (8.12) in unitary gauge becomes 
pî À 
V(¢) > ar KH? + vH? + 7H. (8.40) 


The first term is a constant, (0|V(v)|0) = —y*/4). It reflects the fact that V was defined 
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so that V(0) = 0, and therefore V < 0 at the minimum. Such a constant term is irrelevant 
to physics in the absence of gravity, but will be seen in Section 10.1 to be one of the most 
serious problems of the SM when gravity is incorporated because it acts like a cosmological 
constant much larger (and of opposite sign) than is allowed by observations. 


TABLE 8.1 Feynman rules (~ i£) for the gauge and Higgs interactions after SSB, taking 
combinatoric factors into account.” 


WIW, H: igu g?V = 219 uv Miy WtW; H°: siGuvg? = diia ME 
Zu ZH: Siu GeV = igu Mz ZZ H?: SiGuv 9% = 2 guy Me 
H?: —6idy = —31 Ma HH": —6id = -3i 

Hf f: —ihş = —i™t 


Wi(p)w(QWwe(r): — teCuve(p, 4,7) 


Wr (p)Z.(q)We (r): ig cos OwCyvo(P, 9,7) 
WIW W; W7: i9’ Quvpo 

Wi Zve W; : —teg cos Ow Qupve 
WiZZoW; : —ig? cos” Ow Qupve 
WwW; : —ie? Qn pve 


Cath: q, r) = MACI P)o H Ipo (P Fle T ual? a Dis 
Quvpo = 29uv9po ~~ GupJvo a JuoJvp 


°The momenta and quantum numbers flow into the vertex. Note the dependence on M/v or M?/v. 


The second term in V represents a (tree-level) mass 


My = V—2y? = V2Av (8.41) 


for the Higgs boson. The weak scale is given in (8.36), but the quartic Higgs coupling A is 
unknown. A priori, A could be anywhere in the range 0 < A < ~, so there is no theoretical 
prediction for My. The Higgs was discovered at the LHC in 2012, with mass My ~ 125 
GeV, completing the experimental verification of the standard model. The Higgs discovery, 
properties, and their implications, as well as the theoretical upper (from unitarity and 
perturbativity) and lower (from vacuum stability) limits on My, and indirect constraints 
from precision experiments, will be described in Section 8.5. 

The last two terms in V are, respectively, the induced cubic and the quartic self- 
interactions of the Higgs. 

The SU(2) gauge kinetic energy terms in (8.2) lead to 3- and 4-point gauge self- 
interactions for the W’s, 


Lys =—ig(0,Wy)WiWe [g g7 — 9°? 9 
— ig(ðp WT) WEW [9°7 g” — g” gt] (8.42) 
— ig(O,.W>)WW;t [gP gh? — g g”°], 


po re] 


and 
2 
g nl — Veo — VO 
Lwa= 7 [WEW W; W; over — wii wiwew, over), (8.43) 


where Q,, 0 is defined in Table 8.1. These carry over to the W, Z, and y self-interactions 
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Figure 8.3 Higgs interaction vertices in the standard model. 


provided we replace W? by cos 0w Z + sin 0w A using (8.30) and (8.33) (the B has no self- 
interactions). The resulting vertices follow from the matrix element of i£ after including 
identical particle factors and using g = e/ sin ðw. They are listed in Table 8.1 and shown 
in Figure 8.4. (They also follow from Problem 4.8.) 


The Yukawa Sector 


The fermions acquire masses by the SSB, as illustrated in Figure 8.2. Inserting the unitary 
gauge expressions for ¢ and ¢ in (8.25) into (8.13) yields (for F families) 


F 
H 
—LYuk = 5 mL an ea u? p + (d,e,v) terms + h.c. 


V2 


= uf (M“ + h“H) u% + (d,e,v) terms + h.c., 


(8.44) 


T. : 
where in the second form u? = (u?,u$,---uh,) is an F-component column vector, with 
a similar definition for uù. M“ is an F x F fermion mass matrix, 

uU 
Minn 


=T (8.45) 


u Y 
mn V2 , 
induced by spontaneous symmetry breaking, and 

T“ M* gM" 


ht = = (8.46) 


is the Yukawa coupling matrix. We have already emphasized that I“, and therefore M” 
and h”, need not be diagonal, Hermitian, or symmetric. To identify the physical particle 
content it is necessary to diagonalize M by separate unitary transformations Az, and Apr 
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Figure8.4 The three- and four-point self-interactions of gauge bosons in the standard 
electroweak model. The momenta and charges flow into the vertices. 
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on the left- and right-handed fermion fields. Then (for F = 3), 


Mm, 0 0 
AŻ M"AS =M =| 0 m 0 (8.47) 
0 0 Mt 


is a diagonal matrix with real non-negative eigenvalues equal to the physical masses of the 
charge 2 quarks. Similarly, we denote the eigenvalues of the Yukawa matrix h” by hu, he, 
and h. One also diagonalizes the down quark, charged lepton, and neutrino mass matrices 
by 
d ef yfe ge fe v V AV JY 
AŤ M*A% = M$, AŠ MAS = M$, ANTM AY = MY. (8.48) 


In terms of these unitary matrices we can define mass eigenstate fields uzg = Atty? - 
(up cr tr)", with analogous definitions for ug = Aut, dL R = AT RE? p, ELR = 
AT ne? ps and vL, R = A i RY? po so that, e.g., 


u? M“u?, = ür Mpun. (8.49) 


For the quarks, the mass eigenvalues in Mee are the masses that appeared as bare 
or current masses in the QCD Lagrangian density (5.2) on page 160, even though in the 
context of the full SM they are really spontaneously generated. They are not to be confused 
with the dynamical masses of O(300 MeV) generated by the spontaneous breaking of the 
global SU (3)z x SU(3)pr chiral symmetry of QCD, as described in Section 5.8. 

So far we have only allowed for ordinary Dirac mass terms of the form Z? zv? p for the 
neutrinos, which can be generated by the ordinary Higgs mechanism. Another possibility are 
lepton number violating Majorana masses, which (for v?) require an extended Higgs sector 
or higher-dimensional operators. It is not clear yet whether Nature utilizes Dirac masses, 
Majorana masses, or both. These issues will be discussed in Chapter 9. What is known is 
that the neutrino mass eigenvalues are tiny compared to the other masses, £ O(0.1 eV), 
and most experiments are insensitive to them. In describing such processes, one can ignore 
T”, and the vp effectively decouple. Since MY ~ 0 the three mass eigenstates are effectively 
degenerate with eigenvalues 0, and the eigenstates are arbitrary. That is, there is nothing to 
distinguish them except their weak interactions,’ so we can simply define ve, Vu, Vr as the 
weak interaction partners of the e, u, and 7, which is equivalent to choosing A¥ = A§ so 
that vz, = Ast v? . Of course, this is not appropriate for physical processes, such as oscillation 
experiments, that are sensitive to the masses or mass differences. 

The unitary matrices Az R can be constructed by noting that MM t and MM are 
Hermitian. From (8.47) and its conjugate one has 


m? 0 0 0 
‘4 x Z 5 0 m, 0 0 
A“ MYM“ Ae = Â M Me As = Me = 2 . 8.50 
L L R R D 0 0 ma 9 
0 0 0 


Ay r and the eigenvalues can be constructed by elementary techniques. The Hermiticity of 
MM and M'M guarantees that the ne are real and that the eigenvectors are orthogonal, 


u 


©The special case of a Hermitian (symmetric) M can be diagonalized by Az = Ar (Ar = A). However, 
additional phases in Ar, analogous to (8.51), may be required to ensure positive (and real) eigenvalues. 

T As discussed in Section 8.6.6, this implies that the lepton flavors Le, L, and L>; are separately conserved 
for my = 0. 
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while their special formë implies that m2 > 0. However, the Ay p are not unique, as is 
indicated by the hats. We have implicitly assumed in writing (8.50) that the eigenstates are 
ordered in the same way for MM? and MŻtM. Even then, Af p are only determined up to 


phases. That is, if AY p are solutions to (8.50), then so are 


L=AtKi, Ap= ARK, (8.51) 
where po 
el PiL,R 0 0 0 
0 = Prr 0 0 
Kir = 0 0 eits R O0 (8.52) 
0 0 0 


are arbitrary diagonal phase matrices that correspond to the unobservable phases of the 
mass eigenstate urz pr fields. (If there are degenerate eigenvalues, then there is additional 
freedom in Kj! g associated with arbitrary unitary transformations in the space of degen- 


erate eigenvectors.) For a particular choice of transformations, A‘iM uju will be diagonal 
(provided that the eigenvectors are ordered the same way), but there is no guarantee that 
the elements will be real or positive. The usual prescription is to choose K} for convenience, 
e.g., to remove unobservable phases from the CKM matrix. Then one can choose the phases 
in KẸ so that the my are real and non-negative. The A% are not observable in the SM, 
i.e., they don’t enter the Lagrangian, though in principle they could be observable in some 
extensions of the SM involving new interactions of the R fields. 

Finally, the fermion terms in £ must be rewritten in terms of the mass eigenstate fields. 
For the u quarks, the kinetic energy and Yukawa terms are 


H 
Ly = i Pul + Thi Puh, — jagaa (: + =) + re] 
H 
= ūLiA¥ At Jur + üri AH A% Jur — fos agar UR (1 + =) + he] 
V 


H 
= typi ur + üri ður — ae (1 + ) + he] 
V 


-Ee [am (of) es 


where ur = Urr + urr. The kinetic energy terms remain in their canonical form since Af. R 
are unitary, and the mass terms are of canonical form since Mj is diagonal. The coupling of 
the physical Higgs boson H to ur is —ihy,. = —imy,,/V, i.e., for a given v the Higgs coupling 
is proportional to mass, just as for the gauge and self-couplings in Table 8.1. 

All of these considerations apply to the d quarks and leptons as well, so 


Lo = ar lia —m, (+2) ils (8.54) 


where the sum runs over all of the fermions ~ = u,d,e, and v (with the usual v caveats). 
The coupling of H to Yr is —im,/v = —igm,/2My, which is very small except for the 
top quark. The decay H — bb dominates since My « 2My. The branching ratios for 


8That is, m2, = (M“ax,|M“z,r) > 0, where £r is the normalized eigenvector of MÌ M”. 
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H — Gc and t+r~ are smaller and the other ff rates are negligible, making the Higgs 
difficult to produce and difficult to observe. The Higgs Yukawa couplings are scalar and 
are flavor-diagonal in the minimal model: there is just one Yukawa matrix for each type of 
fermion, so the mass and Yukawa matrices are diagonalized by the same transformations. 
In generalizations in which more than one Higgs doublet couples to each type of fermion 
there will in general be flavor-changing Yukawa interactions involving the physical neutral 
Higgs fields (Glashow and Weinberg, 1977). There are stringent limits on such couplings; 
for example, the Kz — Ks mass difference implies hy,/My < 1076 Gevt, where has is the 
ds Yukawa coupling (Gaillard and Lee, 1974b; Langacker, 1991; Nir, 2015). One must also 
be careful that U(1)g is not broken in such extensions.’ 


The Weak Charged Current (WCC) 


We now rewrite the fermion gauge interactions in terms of the mass eigenstate fields. These 
can be read off from (8.7), using the expressions for the gauge covariant derivatives in (8.9); 
the expressions in (8.27), (8.30), and (8.33) for the gauge boson mass eigenstates; and the 
unitary transformations defined following (8.48), to obtain the fermion part of £, 


g P 
Lyt Lyu = Lo ~ 5% (JeW + Iw) 
99 uy _VPTIP? yy Z, (8.55) 

/g2 F g2 Qt 2 Zp 


= Ly + Lw + LQ + £z, 


where Ly is given in (8.54) and the other terms are the fermion gauge interactions. The 
W= terms are the charged current interaction, with charge raising and lowering currents 


F 
Jt} = X [ERa — ve), Han — 7°) a?,] 
m=1 
(8.56) 
Thy = So [Ea — 78, + By (1 — 9? )uP,] 
m=i 


in the weak eigenstate basis. We saw in Chapter 7 that these violate P and C maximally, 
but are CP conserving. The fermion gauge vertices are shown in Figure 8.5. 

The amplitude for a t-channel four-fermion interaction in the SM is second order, as 
shown in Figure 8.6. It is 


(iLw)? -ig\? p» Uy —i 
ae a OW det gua M2, Foe a. (8.57) 


For small momentum transfer, |q“| << My, this leads to the effective Hamiltonian density 


Gr 


v2 


©The minimal supersymmetric extension of the SM does not involve flavor-changing neutral Higgs cou- 
plings to fermions or electric charge violation due to the Higgs fields even though there are two Higgs 
doublets. There are, however, potential problems associated with possible charge and color violating VEVs 
of scalar quark fields in some regions of parameter space. 


Hess = iM = ZE eda (8.58) 
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Figure 8.5 The fermion gauge interaction vertices in the standard electroweak model. 
gt = tr — 2sin? war and gh = tr, where #2, = t; = +4 and t; = Ë; ===. 


k 2 
The dju;W7 vertex is the same as for ujdj;Wt except Vai; > (Va Jes = Vjij;- The 


lepton-W~ vertices are obtained by u; > vi, dj —> ej, and V, > Ve. 


which reproduces the effective Fermi theory provided we identify the Fermi constant as 


GF g? 1 
Ji = BME, = OF (8.59) 


(Of course, the correspondence carries over to decay processes and s and u channel ex- 
changes.) (8.59) allows us to determine the weak scale v ~ 246 GeV, as in (8.36). The 
phenomenology of the weak charged current interactions was discussed in Chapter 7. 
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Figure 8.6 Charged and neutral current four-fermion t-channel exchange processes 
in the standard model. The W~ or Z carries four-momentum q. 


Rewriting the currents in terms of mass eigenstates 


Jt} = Wry" At AL er + Wry" AT Aldi, — Sh, = 2Wry"Vfv_ + 2dry"Viuz, (8.60) 
=~ —_—— 
Ve Va 


where uz, dL,eL, and vz are the F-component column vectors defined following (8.48). The 
F x F unitary quark mixing matrix 


V, = Av At (8.61) 


describes the mismatch between the unitary transformations relating the weak and mass 
eigenstates for the up and down-type quarks. It is ultimately due to the mismatch between 
the gauge and Yukawa interactions. Ve is the analogous leptonic mixing matrix. It is critical 
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for describing neutrino oscillations and other processes sensitive to neutrino masses, and 
will be described in Chapter 9. As commented above (8.50), however, for processes for which 
the neutrino masses are negligible we can effectively set Vy = I (more precisely, Vp will only 
enter such processes in the combination Vi Ve = I, so it can be ignored). 

An arbitrary complex F x F matrix involves 2F? real parameters. However, V; is uni- 
tary, which imposes F? constraints, (Vi V})mn = mn, so it can be described by F? real 
parameters. Not all of these are observable, however. Recall that the F fields uzr each have 
an arbitrary unobservable phase, as do the dz fields, described by the matrices KET in 
(8.52). One can use the freedom in choosing KET to remove 2F — 1 phase differences from 
V,, so that altogether there are 


F(F-1). (F-1\(F-2 
2 j 2 
m — ——O— 


angles phases 


F? — (QF — 1) = (F —- 1}? = (8.62) 


observable parameters. These consist of F (F — 1)/2 rotation angles (the number of angles in 
an O(F) rotation), with the remainder being observable C P-violating phases. To see that 
such phases are C’P-violating, one has that under CP 


WH & -WF 


7 - n n (8.63) 

UmL Yu Vamndnt > —dnL Y" VamnUmb = -dnr Y" VonmUmL, 
where it is understood that (t, Z) — (t,—7) as well. (The transformation of W+" in (8.63) 
was defined so that the Lagrangian would be invariant in the absence of phases.) The 
hadronic part of the charged current interaction therefore transforms into 


L(t, Z) = E [dry V]ur W; + ür” Vyd W} | 
(8.64) 

g — * 7 * — 
= TA [ür V dW? +d” V} uLW,, ] 3 


which differs from L} (t, —7 ) if there are observable phases!? in V}. One must have F > 3 
families to obtain C'P breaking in the weak charged current interactions, and all three must 
be involved in a given process to lead to an observable effect. These issues will be described 
in detail in Section 8.6. 

For F = 2 families, there is one angle and no phase, so V, is just the Cabibbo rotation 


Veabibbo = ( es n ) : (8.65) 
where sin@, ~ 0.23, as in (7.22) on page 230. Thus, in the four quark theory, the ampli- 
tudes for lz > vez, (dr > up or sr > cz), and (dy > cy or sr, —> uz) are proportional to 
1, cos ĝe, and sin ĝe, respectively. The Cabibbo rotation in (8.65) is an excellent approxima- 
tion to transitions amongst the first two families, even in the full three family case, because 
the elements of V} connecting the first two families to the third are very small. 

For F = 3 one has Vy = Voxm, which involves three mixing angles and one observable 


C'P-violating phase, 


Vua Vus Vub 1 À A3 
Vorm = | Vea Ves Vo |~| A 1 » |, (8.66) 
Via Vis Veo A A? 1 


10 There are no such phases in the weak basis, so the charged current vertices are C P-conserving while 
the quark mass matrix (and therefore the propagators) are C P-violating. The situation is reversed in the 
mass eigenstate basis, but the two descriptions are equivalent for physical amplitudes. 
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where A = sin 0e. The second form is an easy to remember approximation to the observed 
magnitude of each element, which displays a suggestive but not well understood hierarchical 
structure. These are order of magnitude only; each element may be multiplied by a phase 
and a coefficient of O(1). There have been extensive studies of K, D, and B decays as well 
as other decays and scattering processes to determine the elements of Vex m, to test its 
unitarity (which could appear to be violated in the presence of new physics), and to test 
whether the observed CP violation can be described by the phase in Vogm. These tests 
and parametrizations of Vox will be described in Section 8.6. Here we just note that 
Vcgm can most likely account for the CP violation observed in particle processes, but an 
additional source of CP breaking is required to account for baryogenesis, i.e., the origin of 
the baryon (matter-antimatter) asymmetry of the universe. 


QED 


The second gauge interaction term in (8.55) is the QED interaction 


gg p 
VP? tg’? La 


where A,, in (8.33) is the massless photon field and the (Hermitian) electromagnetic current 


La=- (8.67) 


F 
- 2 
=X gp =Y j Tm Um g nt da = eni em 
3 
r m=1 (8.68) 
2 
= = 30 yeu? -iay ad? — Eye? 


is the sum over all fermion vector currents weighted by their charges qr. Jo is purely vector 
since we chose the weak hypercharge assignments y = q — tł for both the left and right 
chiral fields to yield the same q even though their t}, differ. Comparing with (2.218) on 
page 43, we identify the positron electric charge with 


j 
e= yap titw => amat (8.69) 
Jô takes the same form in the mass eigenstate basis, 
F 
J=» Fama = sand — Em" em| = Sanu said eye, (8.70) 
m=1 
since, €.g., 
Tylu = ao ul + ayuh = ary At Atur + aR AÑ Ahur = aylu. (8.71) 


This is ene due to the fact that only fields of the same charge and chirality, such 
as the u?,,, are able to mix with each other. Jo is therefore flavor diagonal and family 
universal, as well as P, C, and CP preserving. The standard model incorporates QED and 
all of its successes, as described in Section 2.12.3. 


The Weak Neutral Current (WNC) 


The Fermi theory of charged current weak interactions and QED were well established before 
the development of the standard model and were incorporated into it. However, the weak 
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neutral current (WNC) interaction (along with the W and Z bosons) was a new ingredient 
predicted by the SU(2) x U(1) unification. 
The weak neutral current interaction is described by the last term in (8.55), 


g g 
sc a a oa Te ae a (8.72) 


where Z,, is the massive neutral boson defined in (8.30). The (Hermitian) weak neutral 
current is 


= ass [#2,,(1 — 7°) — 2qr sin? Ow | oP 


(8.73) 
= “re Wey" (1 — 7° )b! — sin? Ow JE, 
summed over all fermions. Specializing to the SM quantum numbers, 
JE = ub yhuld — Pod} + oP ytu? — ety Met — 2sin® Ow JG (8.74) 
= tipytuz — dpyd, + opy"vzp — Exy“ez — 2sin? Ow JB, f 
where the t? = +4 have been absorbed into the Pz, e.g., 40y” (1 — 7°)u = ūry”uz. The 


neutral current has two contributions. The first only involves the left-chiral fields and is 
purely V — A. The second is proportional to the electromagnetic current with coefficient 
sin? ðw and is purely vector. P and C are therefore violated in the neutral current inter- 
action, though not maximally. There are no phases, so CP is conserved. J is sometimes 
written in terms of its chiral or its V, A components as 


Je =2Ņ dry! [ex(r) Pr + €n(r)Prl br = X bee" [gy - 91477] br (8.75) 


where gt, 4 = €x(r)+€r(r). This form is especially convenient for generalizing to alternative 


or extended gauge theories, and also for incorporating radiative corrections. In the SM 
ex(r) = th, — sin? Ow qr, er(r) = — sin? Ow qr sa 

r —¿ — 2sin? 6 r 43 (8. ) 

Iv = bry sin” Ow qr, GA >= "rh 


Like the electromagnetic current J% is flavor-diagonal and has the same form in the weak 
and mass bases in the standard model; all fermions that have the same electric charge and 
chirality and therefore can mix with each other have the same SU(2) x U(1) assignments, 
so the form is not affected by the unitary transformations that relate the bases. It was for 
this reason that the GIM mechanism (Glashow et al., 1970) was introduced into the model, 
along with its prediction of the charm quark. Without it the d and s quarks would not 
have had the same SU(2) x U(1) assignments, and flavor-changing neutral currents would 
have resulted. To see this, suppose there were only one left-chiral quark doublet, (u? d?.)", 
with s? and the right-handed quarks being SU(2), singlets. Then, in terms of the mass 


eigenstates uz, = u} and 
dp \ _ (cos@. —sin 4 dt 
( Sr ) _ a cos ĝe ) ( a y? (8.77) 


the hadronic part of Jf would be 
JÈ” =a ytu? — do. yd? — 2 sin? Ow ahs 
=tpyuz — dpy“dz cos? be — 5Y“ sr sin? be (8.78) 
= (dry"sz + 5,y"dz) cos ĝe sin 6. — 2 sin? 0w re 
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leading to unacceptable strangeness changing neutral current transitions (as well as large 
box diagram effects), as shown in Figure 7.3. The absence of such effects is also a restriction 
on extensions of the standard model involving exotic fermions (Langacker and London, 
1988b). 

A typical four-fermion process mediated by the Z in the t channel is shown in Figure 8.6. 
In the limit that the momentum transfer is small compared to Mz one can neglect the q- 
dependent terms in the propagator, and the interaction reduces to an effective four-fermi 
interaction!! 


eff = 5 gIZ (8.79) 
The coefficient is the same as in the charged current case because 


Gr 7 g? B g? +9? 
V2 8M% 8M3 


(8.80) 


That is, the difference in Z couplings compensates the difference in masses in the propagator. 
However, unlike the charged current, J? is Hermitian, so there is an extra combinatoric 
factor of 2 when one takes a matrix element of HNG- 

The weak neutral current was discovered at CERN in 1973 by the Gargamelle bubble 
chamber collaboration (Hasert et al., 1973) and by HPW at Fermilab (Benvenuti et al., 1974) 
shortly thereafter, and since that time Z exchange and y — Z interference processes have 
been extensively studied in many interactions, including ve > ve, vN > vN, vN > vX; 
polarized e~-hadron and p-hadron scattering; atomic parity violation; and in ete~ and 
Z-pole reactions. Along with the properties of the W and Z they have been the primary 
quantitative test of the unification part of the standard electroweak model. 


8.2.3 Effective Theories 


Let us digress briefly on effective theories, of which the four-fermion WCC and WNC in- 
teractions derived from the SM are an example. An effective field theory is a description of 
physics at a given energy scale in terms of the degrees of freedom that can actually appear 
as physical states at that energy. In particular, it is often convenient to integrate out the 
fields corresponding to particles too heavy to produce, so that they do not appear explic- 
itly in the theory but whose effects are described by non-renormalizable operators.!? This 
can be done easily in the path integral formalism, but the results can also be obtained by 
examining Feynman diagrams (as we have done for WCC and WNC interactions in (8.58) 
and (8.79)), or by solving the Euler-Lagrange equation for the heavy field, neglecting the 
kinetic terms. To illustrate the latter method, let us rederive the fermion WNC interaction 
in (8.79). If we ignore the Higgs and self-interaction terms, the terms in £ involving the Z 
are 


JE. (8.81) 


11Care must be taken with factors of 2 from t?,, PL,r, the coefficient in (8.72), etc., in deriving (8.79). 
Also, an additional factor of 2 will arise in the off-diagonal terms in the square of Jz or in taking matrix 
elements of the diagonal terms. Finally, some authors absorb the 4 in (8.72) into the definition of Je, but 
fortunately the 2 is explicitly removed from the r.h.s. of (8.75) so that the ez,r(r) and gj, 4 are the same. 

12 According to the decoupling theorem (Appelquist and Carazzone, 1975) heavy particles do not enter 
the effective low energy theory except through renormalized parameters and non-renormalizable operators. 
However, care must be taken in the application of this result when the heavy particles violate a symmetry and 
therefore require strong coupling, as we will see in Section 8.3.6 in connection with the oblique parameters. 
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The Euler-Lagrange equation for the Z is therefore 


ôL ôL g 
E + MZ) Za — 0,0) Z” = ———— Jay. 8.82 

2 (a) 5Z,, os 2) Zu — Ôu 2.cos Oy 7" eee) 
At low energies compared to Mz the derivatives will ultimately be replaced by factors of 
E, where E is a typical external energy for the process being considered. Therefore, (8.82) 


can be solved, yielding 
GI Z E? 
= —>—"__ +0 | —, }. 8.83 

£ 2M3 cos Ow i & een 


This can be reinserted into £z, to yield the effective four-fermion operator 


g? 


-— JE Jz 8.84 
8M2 cos? Ow ap ee 


LNG = -HN = Ia = 

More generally, in an effective theory (e.g., Weinberg, 1995; Pich, 1998; Burgess, 2007; 
Willenbrock and Zhang, 2014; Ellis et al., 2015) one writes the most general Lagrangian 
density Leff for the low energy fields that is consistent with the symmetries, including 
higher-dimensional (non-renormalizable) terms such as those in (8.84). Powers of derivatives 
0,,/M, where M is the heavy particle scale, can also be included, so Leff is a systematic 
expansion in powers of energy. The coefficients of the terms in Leff can be obtained from 
experiment, or can be computed in terms of the underlying theory (as in (8.84)). One 
can treat the low energy effective theory like any other if one uses a mass-independent 
renormalization scheme (such as MS), with a finite number of counterterms for any given 
power of energy. Major applications of effective theories are to describe the low energy limit 
of a known theory, focusing on the important aspects and symmetries (as in the example 
above), or to parametrize the observable effects of still unknown new physics associated with 
a higher scale M (such as the use of the four-fermi interaction to describe the WCC before 
the SM was developed). Another major example is chiral perturbation theory, touched on in 
Section 5.8, in which the low energy strong interactions are expressed in terms of an effective 
theory of mesons and baryons incorporating the spontaneously broken SU (3) x SU(3) flavor 
symmetry, the soft pion theorems, etc. The heavy quark and soft collinear effective theories 
were mentioned in Section 5.6. 


8.2.4 The Re Gauges 


It is straightforward to write the interaction vertices and propagators in an arbitrary Re 
gauge, as was discussed for a general non-abelian theory in Section 4.4. This is useful both 
for higher-order calculations and for applications of the equivalence theorem (Section 8.5). 
Following SSB, instead of the Kibble representation in (8.24) the Higgs doublet can be 
rewritten in terms of shifted fields 


+ 
6=( vB fie i; (8.85) 
V2 


just as in the SU(2) example in Section 4.4. H is the physical Higgs scalar, while z, wt, 
and wT = w*! are the Goldstone boson fields that disappear in the unitary gauge. We also 
define the ghost fields 


nz = cos Ow ns — sin 0w ny, ny = sin Ow n3 + cos Ow ny 


Fi 8.86 
n= m T m2 ae nc = cos 20w nz + sin 20w ny, 


V2 
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where 7; and ny are associated with SU(2); and U(1)y, respectively. The anti-ghost fields 
are n}, n}, and 


nf 
n =n 
ntt = =a (8.87) 


The propagators for the mass eigenstate fields are listed in Table 8.2. 


TABLE 8.2 Propagators for the standard model mass eigenstate fields in the Re gauges.® 


te (k) : z [o sergo] 

iDWa(k): gsi o — E] 

DER): mig [a - MEGS 

iAn(k) : pour iA(k): izn 
iAws(k) : MZA iDan (k): g 
iDys(k): HER iDno(k): WZ 

iS pzr(k) : omy = ign iS, (k) : prem ifa Pr 


a¢ = 0, 1, and co correspond, respectively, to the unitary, ’t Hooft-Feynman, and Landau (renormalizable) 
gauges. A +ie is implicit in each denominator. Neutrino masses are neglected. (The propagators for massive 
Dirac and Majorana neutrinos are considered in Section 9.1.4.) 


The Higgs potential in (8.12) takes the form in (4.81), which is repeated here for conve- 
nience 
V ($) = —p? H? + vH [H? + 2? +2wtw ] + 4 [H? +2 + 2wtw7]’. (8.88) 
The gauge interactions of the Higgs fields are 
Ly=- iZ (w Bwt) Cu + “2 (oh) Zu 


ee (iH — iz] rwt) Wo i2 (woh + ial) wi 


g’ g gg 
+vH (Sw, + % 73) +o (wtW-* +w Wt) B, 
2 2 2 2 2 
+, — (9 tuy- Jz 2 BOR (9° tuy- IZ y2 
putu (Eww; + Bt) 22 (Lwow; 22) 
1 
ka (wt (H —iz)W “+w (H + iz)w+") Bu, (8.89) 
where 5 ; 
W3 +B, 
C, = FHF” — 0095 26y Z, + sin yw Ay. (8.90) 


Equation (8.89) is a generalization of the SU(2) example in (4.84) and of the unitary gauge 
expression in (8.39). (The gauge boson mass term is the same as (8.26).) 
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One can similarly extend the discussion of the Higgs Yukawa couplings in Section 8.2.2 
to include the Goldstone bosons. Analogous to (8.53) the quark mass and Yukawa terms 
are 


H - H 
-Lı = üMp [i+ Z-as] u+dM$ již +n] d 
V V V V 


8.91 

V2 di_vtmes cet an De 7 d $ i 
T [-V} M Pr + MPV} Pr] uw + PE [-MBVPL + VaMpPr] dwt, 

where u and d are column vectors of mass eigenstate fields, Mee are the diagonal matrices 

of mass eigenvalues, and V} = At A? is the quark mixing matrix. A similar result applies to 

the leptons, with u —> v, d —> e, and V} — Vz, provided the neutrino masses can be ignored 

or they are Dirac. 

The ghost interactions with the scalars and gauge fields are 


Lghost = — ig| (0%) n7 ("9") nt] Wii 
— ig| (an**) W$ — (a) Wg | ns — ig (nh) [nt We -w| 


gMw -4 o\ 4 gzMz gMw ik 
+ [= ( tint +n tn) + nina] H + E (nt -n În) iz 


2g 2€ DE 
+ oe [(n**ne = nha) wr + (ninc -= aba) w7]. (8.92) 


8.3 THE Z, THE W, AND THE WEAK NEUTRAL CURRENT 


After the discovery of the weak neutral current in 1973 there were generations of weak 
neutral current experiments, eventually at the precision of a few % and in a few cases 
~ 0.5%. The motivation was in part to determine whether its properties agreed with the 
predictions of the SU(2) x U(1) model, and whether the latter could be distinguished 
experimentally from a number of alternative theories. This goal was largely achieved by 
the model-independent analyses of the data, which allowed arbitrary V and A interactions 
(but generally assumed family universality and V — A couplings for neutrinos). These were 
consistent with the SM but eliminated competing gauge theories predicting four-fermi in- 
teractions very different from the SM predictions. Other possibilities involving purely S, P, 
and T interactions were excluded by the observation of weak-electromagnetic interference in 
processes involving charged particles. More complex gauge theory competitors with similar 
four-fermi interactions but different gauge bosons were largely ruled out by the later dis- 
covery of the Z and W. As later generations of more precise results (including the Z pole, 
Tevatron, LHC, and other high energy experiments) became available, the emphasis turned 
more towards precision tests of the SM at the loop level, leading to predictions for the top 
quark and eventually the Higgs masses, precise measurements of couplings for comparison 
with unification predictions, and testing the underlying structure of renormalizable gauge 
theories. They also allowed searches for and constraints on small deviations from the SM 
predictions that could be attributed to new physics beyond the standard model (BSM). 

In this section we describe some of these tests and their implications. More detailed 
discussions may be found in, e.g., (Kim et al., 1981; Amaldi et al., 1987; Costa et al., 
1988; Langacker et al., 1992; Langacker, 1995; Erler and Su, 2013; Patrignani, 2016). For a 
historical perspective, see (Langacker, 1993). 
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8.3.1 Purely Weak Processes 
ve — ve” Elastic Scattering 


The neutral current reactions v,e~ — v,e~ and De~ — De~ do not occur in the Fermi 
theory, but are predicted to proceed by t-channel Z exchange, as shown in Figure 8.7. They 
were observed and studied in a number of experiments at CERN, Fermilab, and Brookhaven, 
most precisely by the CHARM II collaboration at CERN (for a review, see J. Panman in 
Langacker, 1995). The momentum transfers are tiny compared to the Z mass, so the four- 
fermion effective interaction in (8.79) applies. From (8.74) the effective Lagrangian density 
is 


ve Gr = = ve ve 
sene a EES P Wu Eul oa ye (8.93) 
In the SM one expects, 
ve 1 nD ve 1 
gy ~-z +2 sin Ow, Ja ~~ 5: (8.94) 


gy is predicted to be small, since sin? Ow ~ 0.23. Equation (8.93) is valid in any gauge the- 
ory, provided that any right-chiral neutrino vz does not have significant gauge interactions 
and that any v mass is negligible. We use the symbols gy“, to represent the coefficients in 
the four-fermi interaction. These could differ from the coefficients gf, 4 relevant to the Zee 
vertex in more complicated theories involving multiple gauge bosons. Differences can also 
be generated when one includes higher-order radiative corrections in the SM. In practice, 
however, the radiative corrections to elastic ve scattering (Sarantakos et al., 1983) are small 
compared to the experimental precision except for top quark effects. Radiative corrections 
will be discussed in Section 8.3.4. 

We saw in Section 7.2.2 that there is a charged current contribution to Vee™ — vee 
scattering, which can be put in the same form as (8.93) by a Fierz transformation. The 
results in (7.60) and (7.61) (page 238) can therefore be applied to elastic v,e~ and v,e7 
scattering provided we substitute gy.a —> gj%4. (The mey/E, terms are not important 
at accelerator energies.) Elastic y.e~ and Dee™ scattering have both charged and neutral 
current contributions, as shown in Figure 8.7. Formulae (7.60) and (7.61) still apply to those 
reactions provided we take gy,4 = g% a + 1. 

One can determine gyf and g%° separately by measuring either the energy (y) distribu- 
tions or the total cross sections for both v,e and P,e, up to a four-fold ambiguity associated 
with the overall sign and with the interchange of gy“ and g4,°. Of course, the neutrino beams 
are not monochromatic, so the v„ and P, spectra must be modeled. As shown in Figure 8.8, 
one of the four solutions is consistent with the SM for sin? Oy ~ 0.23. If one assumes the 
validity of the SM then sin? 0w can be best determined from the ratio of the vye and De 
cross sections because many of the systematic uncertainties cancel. The precise value of the 
extracted sin? @y depends on the renormalization scheme, as will be discussed in Section 


8.3.4. The value of sin? 0w from Due elastic scattering, in the on-shell scheme defined by 
sin? Oy = 1 — Mĝ,/M3, is 0.2230(77) (Patrignani, 2016). The extracted values of g¥°4 for 
the SM-like solution are compared with the SM expectations, obtained using the SM pa- 
rameters from the global best fit, in Table 8.3. The agreement is excellent. Limits on small 
deviations from the SM value can be used to limit certain types of possible new physics, 
such as additional heavy Z’ gauge bosons or mixing with exotic heavy fermions (see, e.g., 
Amaldi et al., 1987). 

There were also measurements of Dee at the Savannah River reactor and more recently 
by the TEXONO collaboration at the Kuo-Sheng reactor in Taiwan (Deniz et al., 2010), 
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Vu e e Ve Ve e 


Figure 8.7 Left: diagram for v,e~ —> v,e~ in the SU(2) x U(1) model. Middle and 
right: diagrams for vee~ > Vee”. 


and of vee by LSND at Los Alamos. These are not as precise as the v,e measurements, but 
because of the charged current contribution they can help resolve the four-fold ambiguity. 
As is seen in Figure 8.8 two of the four are excluded. The fourth solution, obtained for 
aye + 9°, is excluded by ete~ — uu data under the plausible assumption that the weak 
neutral current is dominated by the exchange of a single Z boson. These experiments also 
demonstrated interference between the charged current and neutral current contributions 


to ‘vee scattering. LSND obtained —1.01(18) for I = 2o! /ao WCC, where o! and a WCC 
are, respectively, the interference and WCC contributions to the cross section, compared 
with the SM expectation of —1.09, while TEXONO found an interference term of —0.92(34) 
compared to the expected —1. (Other observations of WCC-WNC interference are described 
in their papers.) This is significant because it provides a confirmation of the conclusion from 
WNC-electromagnetic interference (Section 8.3.2) that the WNC is really vector and axial. 


TABLE 8.3 Values of the model-independent neutral current parameters compared to 
the SM expectations for the global best fit.“ 


Quantity Experiment SM Expectation 
ita —0.040 + 0.015 —0.040 
g% —0.507 + 0.014 —0.506 
g7, 0.3005 + 0.0028 0.3034 
Tr 0.0329 + 0.0030 0.0302 
0L 2.50 + 0.035 2.46 
OR 4.56+9:32 5.18 
Ciu + 2Cia 0.489 + 0.005 0.495 
2Ciu — Cia —0.708 + 0.016 —0.719 
2C2u — Coa —0.144 + 0.068 —0.095 
Jay 0.0190 + 0.0027 0.0225 


“From the Electroweak Review in (Patrignani, 2016), where one can also find the full SM expressions 
including radiative corrections and the correlations on the experimental uncertainties. 
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Figure 8.8 Allowed regions in g/° and g4° from v e`, Vee™ , and reactor Dee™ elastic 
scattering, compared to the predictions of the SM as a function of sin? Oy in the 
MS scheme. The SM best fit (shaded area) is almost identical to the experimental 
region. The vpe regions are at 90% c.l., and the bands are at lo. Plot courtesy of 
the Particle Data Group (Patrignani, 2016). 


Deep Inelastic Neutrino Scattering 


There have been many experiments measuring neutrino hadron scattering, including elas- 
tic Vap > Vap scattering, inelastic exclusive processes such as v,p —> v,7N (dominated 
by the A resonance), and deep inelastic v,N — v,X scattering, as well as their p, and 
charged current analogs (for reviews, see F. Perrier in Langacker, 1995; Conrad et al., 1998; 
Formaggio and Zeller, 2012; Patrignani, 2016). Deep inelastic ef p — e+ X (involving both 
y and Z exchange) and etp > ve(De)X (via WF) have also been studied at the HERA 
ep collider at DESY (Abramowicz et al., 2015, 2016). The WCC experiments have been 
especially important as tests of QCD and probes of parton distribution functions, provid- 
ing information complementary to charged lepton scattering. They also constrain the CKM 
matrix elements |V-q| and |V..|. The neutral current experiments have mainly functioned as 
tests of the WNC couplings and determinations of parameters such as sin? Oy. 

Here we will focus on the deep inelastic neutrino scattering, for which many experiments 
have been performed, especially at CERN and Fermilab. The neutrinos are mainly produced 
by meg and Ke2, where the m and K emerge from proton collisions on a target. These are 
almost entirely v, and D, though enough ve(De) are produced, mainly in Ke3 decays, that 
these must be taken into account as background. More wt and Kt are produced than 
a and K~, so v, beams are more intense than y,. The typical neutrino energies at the 
CERN and Fermilab experiments are in the 10’s-100’s GeV range with a broad spectrum. 
The spectrum becomes somewhat narrower in energy in the narrow band beams (NBB), 
in which the energy of the parent hadron is selected. These actually lead to two peaks in 
energy, associated with the pion (lower energy) and kaon decays. Even in the NBB there 
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is considerable uncertainty in the incident neutrino energy. E, can be determined in WCC 
events such as v, N — u` X by measuring the outgoing ~~ and hadron momenta, but this 
is not possible in WNC current events such as v,N — v,X because of the unobserved 
final neutrino. One therefore typically concentrates on the total WNC cross sections. Care 
is still required to determine or model the incident energy spectrum and average over it 
appropriately. 

We will consider the charged and neutral current processes 


WCC: y,N> uw X, DN > ptx 


_ _ (8.95) 
WNC: vaN > vX, DN > v,X, 


where N can be a proton, neutron, or nucleus. The nuclear targets, especially heavy ones, 
yield more events. They are also easier to interpret theoretically, especially if they are 
isoscalar (i.e., equal numbers of p and n), such as '*C, or close to isoscalar, such as °° Fe. 
The p and n-target experiments, performed, e.g., by observing the relevant tracks in a 
deuterium or neon bubble chamber such as BEBC at CERN, have lower statistics but 
provide information on the isospin structure of the WNC or of the parton distributions. 


Deep Inelastic Charged Current Scattering 


The kinematics for deep inelastic neutrino scattering are the same as those for e~ scattering, 
as described in Section 5.5. The calculation of the cross section is also similar. In the relevant 
regime Q? < Mj, the WCC cross section is 

di yz Gh k' 


IKA T 32r? Rw (5B 


where 
L = Tr h"a- 75) Ka-5) K] 


=g k” k” +4 ke k” +4 mw? jePPYO kl k (8.97) 
= g J +€ pKo 

is the leptonic tensor, analogous to the one for e~ scattering in (5.46) on page 175 with 
me neglected. (Note that the symbol v enters several ways: as a Lorentz index, to indicate 
a neutrino, and as the invariant p- q/M. The meaning should be clear from the context.) 
The antisymmetric e“?”° is due to the parity-violating vector-axial interference. The tensor 
LEY for DaN — yt X is obtained by interchanging k and k’, so that the e“?”" term changes 
sign. Similarly, the hadronic tensor WY’ is defined as in (5.55), except Jọ is replaced by the 


Hv 
. TE h D . : . 
hadronic charge-raising current pe . Wy’ can be written in terms of the structure functions 


W73 in a form similar to (5.58): 


Wry = Fz + a wy” (Q?,v) 


1 
M2 


(8.98) 


pq p-q P ; p’q i 
[on — Patan] [pv - P a| WP (02.0) + ienero aga WE” (2), 


where the final term is again due to vector-axial interference.!? The superscripts on the 
structure functions W1,2 indicate that those for e~,v, and Ð are all different, while Wy can 


13Strictly speaking, there are additional possible terms in the hadronic tensor since Jt, is not exactly 
conserved. However, they lead to effects proportional to the lepton masses and are therefore negligible in 
the deep inelastic regime. 
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differ from WY. They also depend on the target (p, n, or N). However, to the extent that 
one can ignore the contributions of the second and third families and of Cabibbo or CKM 
mixing, one has from isospin that 


v D v D v 1 v v D 


ip? 
where N is an isoscalar target. In analogy with e~ DIS, we introduce new structure functions 
FY? (e, Q?) = MWE? (QP), FEE (EQ) =W (Qa), 8100) 


anticipating that the quark model or QCD will imply an approximate scaling behavior 
FY" (x,Q°) ~ F7” (x), up to higher-order corrections. In that case, the differential cross 


section is 


aay G? ME, Viv Viv y Vv 
= ZEZ [ey FY? (a) + (Ly) FY (2) + ayl- FY”), (8.101) 
dxdy T 2 
where the + is for v(D), and the total spin-averaged cross section 
1 1 2 =v, 
5 d og 
a = dz | dy—< (8.102) 
| 0 dady 


is predicted to scale with the neutrino energy in the lab frame, Ep. 

Let us now consider the simple quark parton model approximation, as indicated in 
Figure 8.9, and initially ignore the heavy families and quark mixing. Then, similarly to 
(5.70) for e~ scattering, one has 


FY (x) = 2aF f(x) = 22 [d(ax) 


[d(x) — u(x) 
FS (a) = 2xF7 (x) = 22 [u(x) 1 


2 


where Fa = 2xF is just the Callan-Gross relation, i.e., that the partons have spin-5, and 
the — sign for the antiquarks in F} is due to V — A. Comparing with the analogous relations 
for e~ scattering in (5.70), we see that v DIS allows one to separate q from q by measuring 
F> and F; separately. Equation (8.103) is easily extended to include three families and 
CKM mixing. However, for the energies and precision of the existing experiments it suffices 
to consider just the first two families, since the b and t content of the nucleon is small. Using 


the Cabibbo approximation to the mixing, 
F% (x) =2k(x) d(x) (cos? 6. + sin? Oc&e) 


8.104 
+ s(x) (sin? 0, + cos? 0,€.) + u(x) + e(zx)], ( ) 


where the terms correspond, respectively, to d > u, d > c,s > u,s > c,ū > (d + 5), and 
č — (d+ 5). The + signs are for F(F3), while «(x) = a for Fy and 1 for F3. €.(Q?) is a 
kinematic suppression factor associated with the non-zero c quark mass (we ignore Mu,a,s). 
One expects ĉe — 1 for Q? >> m2, while the small deviations from unity for finite Q? can 
be extracted from the c production data or estimated theoretically. Similarly, 


F} (£) =2«(x)[u(x) + e(x) + d(x) (cos? 0e + sin? 0.£.) 


8.105 
ae s(x) (sin? Oe + cos? Oc€c) | ; ( ) 
To include the full CKM matrix, one can replace 
(cos? 0. + sin? Oege) > (|Vual? + Veal £e) (8.106) 
(sin? 0, + cos? befe) + (|Vas|? + |Ves| Ec) - l 


The Standard Electroweak Theory 285 


Vu H Vu H Vu Vu 
TR ae oe ae = 
5 ee 5 
4h u. wr d_. Z t, a 
df in qq, 
fo Poo f 
e X eS X eer xX 
P` P` P` 


Figure8.9 Typical diagrams for neutrino deep inelastic scattering in the quark parton 
model. The initial and final lepton momenta are k and k’, and q = k — K’. 


(In principle, the coefficient of u(x) in F73 is changed to |Vua|? + |Vus|?, etc., but in practice 
the difference is negligible.) 

An interesting test of the quark model and charge assignments is the 5/18*” rule, which 
concerns the ratio of F> in e~ DIS from an isoscalar target N (Equation 5.79), to Py. In 
the simplified model in which one ignores the s and c quarks in the nucleon and sets ée ~ 1 
one predicts 

Fay _ (Fp + Fn) _ (G+) iowa sored) _ 5 


2 = a = 8.107 
Fy 4} (Fy, + FY,) x (d? + uP + uP + dP) 18’ ( ) 


where q? is the quark distribution function in the proton and we have used the isospin 
relation (5.83). Equation (8.107) agrees well with the data, but not with the expectation 
~ 0.5 of an early competing model with integer charged quarks (Nambu and Han, 1974, 
and Problem 5.1). 

The parton model expressions for the WCC cross sections are 


dor, 2GpME, 
drdy 


{ad [Vaa]? + [Vea PE] +2 [|Vusl? + [Vesl*&e] +2 (a +e) 0-a} 
-__——_" -—_— 
Xa As 


2G ey E (dt+s)+2(a+a(1 -y)] (8.108) 


and 


Pa’, 2G2.ME, J 

L Te Baii E (u +6) (1— y)? +adda + 73).| 

xdy = We (8.109) 
pepsi  [e(ute) (1 y)? +ad + a3], 


T 
where the target labels are not displayed. The second form is valid at sufficiently high energy 
that the c-quark threshold effects are negligible, and ignores the small CKM mixing with the 
third family. One sees explicitly that the total cross sections are predicted to grow linearly 
with energy to the extent that one can ignore the Q? dependence of the quark distribution 
functions, W propagator effects, £e, etc. One sees in Figure 8.10 that this is the case. The 
v, and P, data can be combined with e~ DIS to separate the various quark distributions. 
Of course, QCD predicts logarithmic violation of scaling, which is tested in detail (for a 
review, see Conrad et al., 1998). The observed structure functions are also an input to the 
global QCD fits to parton distribution functions described in Section 5.5. 
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Figure 8.10 Total charged current neutrino cross sections 5%” / E, vs E,,. The simple 
parton model or QCD predict that these should be ~ constant at high energy. Plot 
courtesy of the Particle Data Group (Patrignani, 2016). 


Neutrino induced opposite sign dimuon production is associated with the production 
of charm (c) quarks from d or s, via v(v)N > u? X + c(é), with the subsequent decays 
c —> (s,d)u*v, or €> (3,d)u-v,. By measuring these dimuons for both v, and 7, and in 
different F, ranges, one can extract |Voq|?,S|Ves|?, and £. separately from the experimental 
data, where S' is the momentum carried by s quarks, S = f ws(a)dx. 


Deep Inelastic Neutral Current Scattering 


The neutral current processes v,,(V,,)N — v,,(¥,,)X have been especially useful in testing 
the WNC predictions of the standard SU(2) x U(1) model. They were extremely important 
in establishing the correctness of those predictions to leading order and in limiting small 
deviations. Prior to the Z-pole experiments at LEP and SLC they provided the most precise 
measurements of sin? Oy. 

From (8.75) and (8.79) the effective four-fermi interaction for WNC v-hadron scattering 
in the SM is 


= i -5 Dyt (1-7) 
(8.110) 
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where!* from (8.76) 


1 2 2 
ey (u) = +5 -3 sin? Oy, ef (u) = -3 sin? Ow 

i 4 l (8.111) 
e(d) = | + 3 sin? Ow, eh (d) = 3 sin? Ow. 


Just as for ve scattering, (8.110) will continue to hold in an arbitrary gauge theory with 
neutrinos coupling to V — A, or can be extended to include radiative corrections to the SM 
tree-level predictions (Marciano and Sirlin, 1980; Sirlin and Marciano, 1981), for appropri- 
ate values of the etn We therefore allow for arbitrary ef n(u, d), but will assume family 
universality, i.e., that €7"p(d) = e{”r(s) and e4"p(u) = ef"p(c). 

It is then straightforward to show that in the simple parton model, 


p ES 
d Tke _ 


02 ME, 
dxdy T 
+ [EDP + DPA- y)’] (wd + zs) 


| 
+ {lei (u)[? + lef (uw)? (1 = 9)? | (wii + wee.) 
| 


+ [IERD] + lez"? A — y)"] (wd +28), 


[et + let (u)[?  — y)?] (wu + ae.) 


(8.112) 


while €4"(r) + e¥? (r) for D. 

Now, consider WCC and WNC scattering from an isoscalar target, and ignore the s and 
c content of the nucleon and take £. ~ 1. (One must correct for these in the actual analysis.) 
Then, from (8.108), (8.109), and (8.112), the total cross sections are 


2G2.ME, f! 1 _ 9G2ME; ftrl 
D= rf [za + zaq] dz, a. = | [zza + wal dx 
T 0 3 T 0 3 
2G}ME, f! 1 1 
Tre = oe f (st [za t va] GAl xq 4 va)) dx (8.113) 
. 2GZMEp f! 1 1 
Õre = eit f (o [zza + ra] + op [za + va]) dx, 


where q = (u? + d?)/2 and q = (u? + d?)/2. The effective L and R couplings g} r are 


1 5 
g? = lef" (u) |? + ltd)? = zm sin? Oy + 9 sin’ Ow 


(8.114) 


V vV 5 . 
IR = jeg (u)? T eR (d)|? = z siní Ow, 


where the expressions on the right are those for the tree-level standard model. 

It is useful to consider the ratios of WNC and WCC cross sections, because many of 
the uncertainties associated with the strong interactions, neutrino fluxes, and systematics 
cancel in the ratios. Under the same approximations as (8.113) one finds 


a =f g 
Re=sew~ gi ton, Ro = Ze ~ gi 4 = (8.115) 
cc cc 


M4Similar to gý a, we use the symbols eh (r) to indicate that they are the coefficients in the four-fermi 
interaction, and can differ from the Zqq vertex factors in (8.75) by higher-order corrections or new physics. 
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where A 
a”, $e = Jo tax) dx 


Ce 1+5 o J xq(x)dx 


can be measured directly. € is the ratio of the part of the nucleon’s momentum carried 
by antiquarks to the part carried by quarks. One would have r = z for q/q = 0, but the 
observed value r ~ 0.44 corresponds to e ~ 0.125. It is remarkable that within the stated 
approximations the details of the quark distributions cancel out, except for r, which can 
be measured directly. Although this was shown here within the simple parton model, the 
result essentially follows from isospin symmetry alone, except for a term involving Y? that 
is weighted by a small coefficient of sin* Ow (Llewellyn Smith, 1983). 

The cross section ratios have been measured to 1% or better by the CDHS and CHARM 
collaborations at CERN and by CCFR at Fermilab during the 1980s and 1990s. Although 
(8.115) gives a good first approximation, in practice one must correct for nonisoscalar target 
effects (Nn # N, in °° Fe); s(x) and c(x); the c threshold, €.; third family mixing; the W 
and Z propagators; radiative corrections; experimental cuts; and the QCD evolution of 
the quark distributions. Since these are relatively small corrections for the *C and 56 Fe 
experiments, it suffices to use the QCD improved parton model described in Section 5.5 to 
estimate them. The same estimates can be used for the n and p target data, for which the 
isospin argument does not apply: even though the theoretical uncertainties are larger for 
such targets, the model is adequate since the experiments are less precise. 

The isoscalar target experiments yielded the most precise determinations of sin? Oy prior 
to the Z-pole era, sin? 0w = 0.233(3)(5), where the first error is experimental. The second 
error is theoretical and is largely due to the uncertainties in the charm quark threshold 
effect e. Even though c production represents less than 10% of the WCC cross section and 
Ee can be determined to ~ 15% from dimuon data (page 286), the remaining uncertainty 
dominated the error. The theoretical uncertainties can be minimized by using the Paschos- 
Wolfenstein ratio (Paschos and Wolfenstein, 1973), 


Il 
| 


(8.116) 


oV. — T jl 
R` = 74 ne agg? aw sin? 6 8.117 
ay — a, JLT IR 5) n Ow, ( ) 


for an isoscalar target, which follows from isospin. (There is still a small residual uncertainty 
from the charm threshold, but it is suppressed by sin? 64.) Utilizing RT requires a high 
intensity and high energy P, beam, which became possible in 1996 using the sign-selected 
beam at Fermilab. 

The NuTeV collaboration (Zeller et al., 2002) utilized the sign-selected beam and a 
version of (8.117) to obtain a more precise value sin? 0w = 0.2277(16), which is insensitive 
to the top quark and Higgs masses and has little uncertainty from the c threshold. However, 
this value is ~ 30 above the current value of sin? Oy = 0.2234(1) from the global fit to all 
data (Patrignani, 2016). The discrepancy increased by about lo due to remeasurements of 


the Ke3 branching ratio (Section 8.6.1), which affects the D contamination of the D, beams, 
but this was roughly compensated by NuTeV’s subsequent measurement of an asymmetry in 
J dz x|s(x) — 5(x)] (Mason et al., 2007). The origin of the discrepancy is not understood. It 
could be an indication of some sort of new physics, such as contributions from a Z’ (Davidson 
et al., 2002). However, it could also be associated with additional subtle QCD effects, such as 
larger than expected isospin breaking, nuclear shadowing, or NLO or electroweak radiative 
corrections. It is hard to draw any conclusions until there is a full global analysis of these 
issues. 

The neutrino-hadron data can be used in a global analysis to determine ep" n(u, d). The 
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results are shown in Figure 8.11 and Table 8.3 (where 0z R = tan™"[eZ"e(u)/e4"p(d))). It 
is seen that the deep inelastic isoscalar data determine g? and gẹ quite well. The isospin 
structure, which depends on DIS from p and n targets and other reactions, is well-determined 
for the L couplings, but only poorly for the ey. Nevertheless, the v-hadron data is consistent 
with the SM for sin? Ow ~ 0.23. 
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Figure8.11 90% c.l. allowed regions in e¥”p(u) vs e{”p(d) from i, N elastic, exclusive 
inelastic, and deep inelastic scattering, compared to the predictions of the SM as a 
function of sin? Oy, updated from (Amaldi et al., 1987). 


8.3.2 Weak-Electromagnetic Interference 


Following the discovery of the weak neutral current in the early 1970s, it took some years 
to establish that it was uniquely consistent with the predictions of the SU(2) x U(1) model. 
As described in Section 8.3.1, ve and v-hadron scattering eventually zeroed in on the SM 
predictions when analyzed in a general V, A framework. However, it was conceivable that 
the WNC could be due to S, P, and T couplings, e.g., due to the exchange of spin-0 or 
2 particles. These all involve a helicity flip of an initial left-handed neutrino (or right- 
handed 7) produced in a charged current decay, into a neutrino of the opposite helicity 
(Equations 2.213 and 2.214 on page 42). However, it is not feasible to measure the final 
helicity. In fact, there is a confusion theorem that the final electron or hadron distributions 
in v,e~ elastic scattering or deep inelastic v, N scattering predicted by any V, A theory can 
be duplicated by some combination of S, P, and T (Kayser et al., 1974). 

The easiest way to resolve this ambiguity was to observe the interference between WNC 
amplitudes and those involving other interactions known to be V and/or A in character, 
which could not occur for S, P, or T at high energies where fermion masses can be ignored. 
In fact, some interference between WNC and WCC amplitudes was eventually observed, 
such as in y.e~ and @e~ elastic scattering. However, the more precise tests (and also the 
earliest chronologically) involved interference between Z exchange and electromagnetism in 
eq, ete, and (more recently) in e~e~ interactions. At low energies, such as in atoms, the 
WNC is only a tiny perturbation on the Coulomb or other electromagnetic effects. However, 
QED is purely vector and parity conserving, so observation of V A interference is a clear sign 
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of the WNC. (These are usually parity violating, but in some cases can be parity conserving 
if they involve the product of two axial currents from the WNC.) At higher energies, the 7 
and Z exchange amplitudes may be comparable and Z propagator effects may be observable 
as well. There have by now been many observations of WNC-electromagnetic interference, 
all of which are in agreement with the predictions of the standard model. They include: 


Polarization (or charge) asymmetries in deep inelastic eD — eX (SLAC and Jeffer- 
son Lab), wC — uX (CERN), and e*p — e+X (DESY); in low energy elastic or 
quasi-elastic polarized electron scattering at Bates, Mainz, and Jefferson Lab; and in 
polarized e~e~ Moller scattering (SLAC). 


Atomic parity violation in cesium (Boulder, Paris), thallium, and other atoms. 


Cross sections and forward-backward asymmetries in e+e~ — L4, qq, cé and bb at PEP, 
PETRA, TRISTAN, and LEP 2. 


Forward-backward asymmetries in ‘pp — ete” and utu” at the Tevatron and LHC. 


Parity Violating ¢*-Hadron Interactions 
The parity-violating part of the effective WNC eq Lagrangian density is 


G s ee, fe 
—£4 = a5 > o [Cu Bre Gras + Cuk ye Gra]; (8.118) 


i=u,d 


where the tree-level expressions in the SM are 


Oru = —5 + $ sin? Ow, Coy = 5 + 2 sin? Ow 

in he 1 ; (8.119) 

Cid = 3 3° Ow, Coq = geen Ow. 

Early attempts to measure the effects of WNC-electromagnetic interference in atoms 

were unsuccessful, leading to considerable confusion and doubts concerning the correctness 

of the SU(2) x U(1) model. However, those results turned out to be erroneous. The first 

significant interference observation was in the SLAC measurement (Prescott et al., 1979) of 

the parity-violating polarization asymmetry Apy = (or — o L)/(orR +0_) for deep inelastic 

e` scattering on deuterium, where op, refer, respectively, to the cross section for a right- 
or left-handed e~. One expects 


Apv 1—(1—y)? 
= 12 
Q? ee TE (8 0) 
where Q? and y are the usual kinematic variables for DIS, and 
y 
3G F 1 3G F 3 5.92 
= i C > + — sin* 0 
a 5/270 ( 2 12) SM 5/270 ( 4 3 w) (8 121) 
pees (c Ca) Pu (sin? =) | 
i 52ra EI A 52ra KA 


They observed an asymmetry, consistent with the SM expectations for sin? 0w = 0.224(20), 
agreeing with the neutrino experiments and resolving the confusion. Subsequent measure- 
ments of polarization asymmetries in u* DIS confirmed the result. 
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More recently, there have been a number of low energy parity violating electron scat- 
tering (PVES), i.e., polarization asymmetry, experiments at Jefferson Lab and elsewhere. 
These include a new eD DIS experiment, PVDIS (Wang et al., 2014), improving on the 
SLAC results, and an elastic ep asymmetry experiment Qweax (Androic et al., 2013) that 
yielded the first measurement of the weak charge of the proton, Qiy = —2(2Ci, + Cia). 
These have constrained Ciu,a to a small region consistent with the SM, as can be seen in 
Table 8.3 and Figure 8.12. These results and future experiments are reviewed in (Kumar 
et al., 2013; Erler et al., 2014), and the implications for new physics in (Erler et al., 2003; 
Cirigliano and Ramsey-Musolf, 2013). 


Atomic Parity Violation 


The parity-violating WNC can induce mixing between S and P wave states in atoms, 
leading to such effects as the rotation of the polarization plane of linearly polarized light 
as it passes through an atomic vapor, or differences in transition rates induced by left- 
and right-circularly polarized photons. Observations have been made in cesium, bismuth, 
lead, thallium, and ytterbium (for reviews, see Ginges and Flambaum, 2004; Roberts et al., 
2015; Patrignani, 2016). The most precise have been measurements of 6S — 7S transitions 
in cesium in Paris and Boulder, with the most recent Boulder results at the 0.4% level. 
Furthermore, cesium has a single valence electron outside a tightly bound core, allowing 
accurate calculation of the matrix elements needed to interpret the results. 

The effective interaction in (8.118) leads to a non-relativistic potential for the e~ in an 
atom Z, N: 


GF 
ee S (Fo), Fe «Bol, 8.122 
IAOW PE) de Fe} (8.122) 


where pe is the electron momentum operator and Qw is the weak charge 
Qw = —2[Ciu (22 + N) + Cia(Z + 2N)| ~ Z(1 — 4 sin? 0w) — N (8.123) 


(see Problem 8.11). In (8.123) we have kept only the Ciu,a terms. This involves an axial e7 
current and vector hadronic current, with the latter adding coherently over the nucleons 
in a heavy atom because it is spin independent. In fact, the effects of this term scale as 
Z3. One factor is obvious from the coherence effect in Qw, while the others involve the 
electron wave function and momentum near the nucleus. There are no such enhancements 
for the C2,,q (nucleon spin-dependent) terms, which require unpaired nucleons,!° so the 
Ciu,a strongly dominate. 

As mentioned, Qw has been measured in cesium at the 0.4% level. Neither the theoretical 
expression in (8.122) nor the treatment of the atom as hydrogen-like are adequate for 
such precision. In practice, one must correct for the finite nuclear size, treat the electron 
relativistically using the Dirac equation, carry out a many-body Hartree-Fock calculation of 
the wave functions, and include both electroweak and QED radiative corrections (Ginges and 
Flambaum, 2004; Roberts et al., 2015; Patrignani, 2016). The latter are difficult due to the 
large nuclear charge, but careful treatments have now yielded a precision of ~ 0.5%, allowing 
a determination Qw (Cs) = —72.62(43), in reasonable agreement with the SM expectation 
—73.25(2) (Patrignani, 2016). The corresponding weak angle (in the MS scheme near p = 0) 
is sin? Oy = 0.2358(20). 

Experiments in hydrogen or deuterium would be extremely clean theoretically, but the 


15Nevertheless, the small spin-dependent effects can be separated by measuring different hyperfine tran- 
sitions. They have been observed in cesium, but are dominated not by the WNC but by the larger electro- 
magnetic anapole moments mentioned in Section 2.12.4. 
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effects are much smaller. To date no atomic measurements have been completed, though 
Qfy = 0.064(12) has been measured in PVES (Androic et al., 2013), to be compared with 
the SM value 0.0708(3). Other future possibilities include measurements of ratios of effects 
in different isotopes of the same atom (for which many theoretical uncertainties cancel), or 
on other atoms or ions with significantly enhanced parity-violating effects. 

The results from polarized lepton asymmetries and atomic parity violation can be com- 
bined in a global analysis. The combinations Ciu + Cia and 2C'2, — Coq are now very well 
determined and in agreement with the SM, as can be seen in Table 8.3 and Figure 8.12. 
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Figure 8.12 Allowed regions in Ciu — Cig vs Ciu + Cig from atomic parity violation 
(APV), parity-violating electron scattering (PVES), and the global combined anal- 
ysis at 68% and 95% c.l. The SM prediction as a function of sin? 0w in the MS 
scheme and the global best fit prediction (Patrignani, 2016) are indicated. Plot 
reproduced with permission from (Androic et al., 2013). 


Polarized e~ e~ scattering 


The parity-violating ee interaction 


-L1 = -5 giv Eye Eye, (8.124) 
where gjy = ł — 2 sin? Ow (~ —Qfy/2) at tree level in the SM, leads to a polarization 
asymmetry Apy in Møller (e~e~ — e~e7) scattering of electrons on an atomic target. 
Apy has been measured by the E158 collaboration at SLAC at low Q? ~ 0.026 GeV? using 
a nearly 90% polarized beam (Anthony et al., 2005). The process is dominated by t and 
u-channel y exchange, with Apy generated by interference with Z exchange. Their mea- 
sured asymmetry corresponds (Erler and Su, 2013) to g% = 0.0190(27), compared with the 
SM value 0.0225. Moreover, it was the first clear confirmation of the SM prediction (Czar- 
necki and Marciano, 2000) for the running of the weak angle, to be discussed in Section 
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8.3.4. A more precise measurement of Apy is underway at Jefferson Lab by the MOLLER 
collaboration. 


e-e+ — ff Below the Z 


There have been many measurements of e~e+ —> ff, where f = e, u, T, b,c, or q (i.e., an 
unidentified quark flavor), below the Z pole at SLAC (SPEAR, PEP), DESY (DORIS, 
PETRA) and KEK (TRISTAN), as well as measurements at or near the Z-pole (LEP, 
SLC), and above it (LEP 2). The PEP and PETRA measurements (for reviews, see (Wu, 
1984; Kiesling, 1988) and D. Haidt in (Langacker, 1995)) were able to observe effects of the 
s-channel Z exchange as a perturbation on the dominant one photon contribution, but were 
at low enough energy that they could be treated as a four-fermi interaction. TRISTAN was 
at a higher energy, where the virtual Z propagator effects were non-negligible (e.g., Mori 
et al., 1989). The principal observables relevant to the Z were the total cross sections and 
the forward-backward asymmetries 


Sg ids, Appa 8.125 
o =0F +B = ae ( ) 
where i P 
do do 
= d 0 = d 0 8.126 
p | deos 59 wa I. deos 99 ( ) 


and @ is the angle between the e~ and f in the center of mass. It is difficult to present the 
results in a model independent form because of the Z propagator. Instead, we will write 
the tree-level formulae (for arbitrary s) assuming that only the s-channel photon and Z 
exchanges are relevant, but allow arbitrary Z couplings. (t-channel contributions must be 
included for Bhabha scattering, e~e* — e~ eT). 

Polarizations and forward-backward asymmetries for massless fermions are easily calcu- 
lated using straightforward generalizations of the helicity amplitudes in (2.249) on page 49. 


Suppose the amplitude for e` (p1) e* (p2) > (3) f(p4) is 


M = — ieLL ü3Yu PLV, bay" Pru — terre tis YpPRvs boy" Pru (8.127) 
— ieLR üY Prv Voy" Pru — ter üz Yu Prva Voy" Prun, 


where the first (second) subscript on the e is the chirality of the f(e~). In the massless 
limit the four terms in (8.127) do not interfere with each other, and coincide, respectively, 
with the helicity amplitudes M(—+,—+), M (+—,+—), M(—+,+-—), and M (+-,-+) 
defined below (2.239). Reflection invariance only holds in the special case en, = Err and 
ELR = ERL. Nevertheless, we can use (2.249) to obtain 


M (—+,—-+) = icerz s (1+ cos), M (+—,+-—) = ierrs (1 + cos0) 


8.128 
M (—+,+-—) = ieLrRs (1 — cos0), M (+—,—+) = ierz s (1 — cos0), ( ) 


from which any cross section or polarization effect can be calculated. For example, the 
spin-average cross section and forward-backward asymmetry are 


a= f D = = (een? kal Hean Fe (8.129) 
—1 3278 48T Bi RR ae R ` 
a 3 (lex? + lerr|? — lerr? — lerz|* 
A = R 8.130 
E 4 (ee ee eee) ( ) 
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Assuming that only the s-channel y and Z exchange are relevant, the coefficients in 
(8.127) are given by 


cap = WE — 4V8GpMED(s)ea (fee) (8.131) 


where A,B = L or R. The ez, r(r) are the chiral couplings of the Z to fermion r. They 
are defined in (8.75) and their SM values given in (8.76). D(s) = (s — M3 + iMzV'z) ` is 
the Breit-Wigner form for the propagator of an unstable Z of width Iz, as discussed in 
Appendix F. It is introduced here so that the expression is valid below, near, and above the 
Z pole. Equation (8.131) can be rewritten by factoring out the pure QED piece 


a (conta tien bel eat Hess (8.132) 


where 
Gr sMzZ Mzľz 


; tanôr = — 
22ra [(M2 — s5)? + M3r3)'”? R M2 = 
cos dp — 1 at low energies for [z/Mz « 1, while cosép — —1 for s > M3. At s = M3 


we have that cosdr — 0 and the interference term vanishes. Substituting into (8.129) and 
(8.130), we find 


Xo = (8.133) 


sa. Apg = 3Fz/4F,, (8.134) 
where 09 = 4nQa? /3s is the QED cross section in (2.232), and 


2X0 e e 
F =1+ Oe gi gy cosôr + Qz ® (af + gf?) (9%? + 97) 
" i (8.135) 
x6 f 
Fa = +2 gf g% cosôr + 2 gh 9% of 96 
Qf AJA Q? AJA IV IV: 


We have used that ez (r) + er(r) = gọy,4 and therefore ef (r) — eR(r) = gp gh. For s « Mz, 
Xo = O(s/MZ) « 1 and ôr ~ Tz/Mz « 1, so only the first term in F> is relevant. This 
involves only the axial couplings, which in the SM are independent of sin? 0w. The leading 
contribution to Arg is therefore an absolute prediction. For f = u~ the SM couplings are 
pir ies. =. 1 fb = pÊ — 1 + 2 

ga = 9a =~ 5: fy = ay =~, +2 sin Ow. (8.136) 

The observed asymmetry, shown in Figure 8.13, agrees with the SM prediction. 
Asymmetries for f = 7~ and f = b also agreed well, and were important in establishing 
the canonical doublet assignments of the r; and bz and singlet assignments of Tp and 
br (i.e., that the third family is sequential), prior to the direct discovery of the v, and t. 
This was quite important, because the third family could well have been different from the 
first two. As an example, the JADE collaboration at PETRA obtained the first significant 
measurement a the bb asymmetry, at \/s = 35 GeV (Bartel et al., 1984), from which one 
can extract! gł} = —0.54+ 0.15. In the context of SU(2) x U(1), but allowing arbitrary 
assignment of the bz and bg to SU(2) multiplets, one expects g} = tè, — tîr, where 
tip are their T° eigenvalues. The JADE value is consistent with the SM assignment 


16Subsequent LEP experiments allowed a much more precise determination of both ge 4 (Schaile and 
Zerwas, 1992). 
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Figure 8.13 Experimental results on the forward-back asymmetry in e 


t}; = —1/2, tę = 0, but not with several alternative models for the third generation, such 
as the anomaly-free models listed in Table 8.4. These include a mirror family (Maalampi 
and Roos, 1990), involving L-singlets and R-doublets, and non-chiral (vector) models (e.g., 
Frampton et al., 2000). The four models in Table 8.4 predict gł = —1/2,+1/2,0, and 0, 
respectively, so only the sequential model was viable. In particular, the “topless” singlet 
vector model was excluded, strongly suggesting that the the top quark had to exist (barring 
extremely exotic alternatives). A similar result was also obtained by the absence of flavor 
changing neutral current decays B — ¢*¢~X by the CLEO collaboration at CESR, which 
would have been generated by the violation of the GIM mechanism if the bz, rR were singlets 
that decayed by bz —dz or by —sz mixing (Kane and Peskin, 1982). Similarly, the 7 couplings 
were obtained by observations of the 7 lifetime, decay distributions, FB asymmetry, and 
absence of flavor changing neutral current decays T > Ll, requiring the existence of the 
vr. In fact, the canonical assignments for all of the known fermions could be extracted from 
the data (Langacker, 1989b). 

A forward-backward asymmetry does not by itself require parity violation, and in fact 
a non-trivial asymmetry is generated by higher-order QED corrections that must be taken 
into account in the analysis. As an example, the effective interaction in (8.127) would be 
reflection invariant for €L = €rr and eR = ERL = 0, though that would be difficult to 
achieve in a simple gauge theory. 

The y— Z interference effects in e~ e+ — ff have also been observed above the Z-pole at 
LEP 2 (Schael et al., 2013). The results are in agreement with the SM. Forward-backward 
asymmetries in the inverse processes, pp,pp > L6, L = e or u, will be mentioned in 
Section 8.3.5. 
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TABLE 8.4 Anomaly-free SU(2) assignments for the third family.” 


Sequential family ( bi ) ( ) TR tR bR 
T L L 
i : 2 Vr t 
Mirror family Tr tL bL ( = ) ( b ) 
T R R 
Singlet vector Tr br TR br (topless) 
Doublet vector ( Hi ) ( ) ( T ) ( ; ) 
T JF É T JR R 


"Singlets vrr or (mirror) vrg could be added. The leptons could be interchanged in the two vector 
models, e.g., vector doublet quarks and vector singlet 7. More complicated assignments are also possible. 


8.3.3 Implications of the WNC Experiments 


Even before the era of the high precision Z-pole experiments at LEP and SLC began in 
1989, the weak neutral current experiments and observation of the W and Z had done much 
to establish the standard electroweak model. Global analyses of the data were essential to 
the program, especially in constraining alternatives or extensions of the SM, because no 
one type of experiment was sensitive to all of the possible parameters. They also allowed a 
unified theoretical treatment of similar experiments. The caveat is that one must be careful 
with the treatment of systematic and theoretical uncertainties and in correlations between 
uncertainties. 
Major results included: 


Model-independent fits showed that the four-fermion interactions relevant to vq, ve, 
and eq were for the most part uniquely determined. They were consistent with the 
SU(2) x U(1) SM to first approximation, eliminating many competing gauge theo- 
ries and alternatives involving spin-0 or 2 exchange, and limiting small deviations. 
Measurements of My and Mz agreed with the expectations of the SU(2) x U(1) 
gauge group and canonical Higgs mechanism, eliminating more complicated alterna- 
tive models with the same four-fermi interactions. 


The combination of WCC and WNC results allowed the SU(2) x U(1) representa- 
tions for all of the known fermions to be uniquely determined, i.e., that all of the fz 
transformed as SU(2) doublets and the fr as singlets. That required that the t and 
v, had to exist, as partners of the b and T~. 


The precision of the deep inelastic neutrino data was high enough to require the 
application of QCD-evolved structure functions. 


Both WNC and WCC processes were measured precisely enough to require QED and 
electroweak radiative corrections in their interpretation, and attention had to be paid 
to the definition of the renormalized sin? 0w, as will be discussed in Section 8.3.4. 


sin? Ow = 0.230 + 0.007 (using the on-shell definition), and m; < 200 GeV from the 
electroweak radiative corrections. 


The SU(3) x SU(2) xU(1) couplings were determined well enough to exclude the gauge 
coupling constant unification predictions of the simplest form of non-supersymmetric 
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grand unified theories (GUTs). However, they were consistent with the predictions of 
the simplest supersymmetric GUTs, as will be discussed in Section 10.2.6. 


Significant limits could be placed on many types of new physics that could perturb the 
SM predictions, including a heavy Z’, new fermions with exotic quantum numbers, 
exotic Higgs representations, leptoquarks, and many types of new four-fermi operators 
generated by other types of underlying new physics. In many cases both WNC and 
WCC constraints were critical. 


8.3.4 Precision Tests of the Standard Model 


The precision electroweak program entered a new phase in the late 1980s with the advent 
of the Z-pole experiments at LEP (CERN) and SLC (SLAC), which eventually allowed 
tests of the SM at the 0.1% level. With such precision, care was needed in the application 
of QED, electroweak, QCD, and mixed radiative corrections, and in the definition of the 
renormalized weak angle sin? 0w. The radiative corrections were sensitive to the top quark 
and Higgs masses and to the QCD coupling as, and therefore allowed these quantities to 
be constrained or predicted by the data. Here, we briefly survey some of the relevant issues 
(which were also necessary, though less critical, for the WNC experiments described in 
Sections 8.3.1 and 8.3.2). Most of the numerical results are from the Electroweak review by 
Erler and Freitas in (Patrignani, 2016). 


Input Parameters 


The basic input parameters relevant to WNC, WCC, and Z/W properties in the SM are: 
(a) the SU(2) x U(1) gauge couplings g and g'; (b) the weak scale, v = V/2(0|y°|0) ~ 246 
GeV, that characterizes the SSB; (c) the Higgs mass My defined in (8.41) (or, equivalently, 
the quartic coupling A), which enters the radiative corrections; (c) the heavy fermion masses 
Mt, Mp, etc., which are needed in radiative corrections and phase space factors; and (d) the 
strong fine structure constant a,, which enters radiative corrections. Especially important 
are g, g', and v, which are needed for weak amplitudes and mass formulae at the tree level. 
However, it is convenient to trade them for other quantities that are precisely known and 
more directly measured, and to express the theoretical expressions for other observables in 
terms of them. A conventional choice is: 


The fine structure constant, a = 1/137.035999139(31) (Mohr et al., 2012), determined 
from the anomalous magnetic moment of the electron and other low energy QED 
tests, as described in Section 2.12.3. It is related by e = g sin Ow (8.69). However, the 
running a must be extrapolated to Q? = M2, as described in Section 2.12.2. With 
the conventional (on-shell) QED renormalization scheme, 


a (M2) = 


1— Aa(M2 
1= Aa(Mz) _ 198 997(17), (8.137) 
Q 


where the uncertainty is mainly from the hadronic contribution Aaf), = 0.02764(13) 
to Aa (for as(M2) = 0.118(2)). In the MS scheme one expects @~1(M3?) ~ 
127.950(17). Even these tiny uncertainties dominate the theoretical uncertainty in 
the precision program. 


The Fermi constant, Gr = 1/./2v? = 1.1663787(6) x 1075 GeV, defined in terms of 
the muon lifetime using (7.51). 
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The third parameter was traditionally sin? 0w, defined at tree level by g’?/(g? + g?) 
and determined by WNC processes or properties of the Z. However, following the 
ultra-precise determination of Mz = 91.1876(21) GeV at LEP, it has become 
more common to view Mz as the third input parameter. At tree level Mz = 
ev /2 sin Ow cos Ow by (8.34). 


Definitions of the Renormalized sin? 0w 


As discussed in Section 2.12.1, higher-order corrections to amplitudes and other observables 
are usually divergent. In renormalizable theories the divergences can all be absorbed into 
renormalized (physical) charges, masses, and wave function renormalizations. To actually 
carry out the renormalization one must first regularize the divergent integrals in a way that 
preserves the symmetries, such as Pauli-Villars, lattice, or dimensional regularization, or to 
avoid the divergences in the BPHZ scheme (see, e.g., Collins, 1986). It is also necessary to 
define the renormalized quantities. Two popular definitions are the on-shell and the modified 
minimal subtraction (MS) schemes. The on-shell scheme was illustrated in Section 2.12.1, 
where the renormalized electron mass m was defined as the position of the propagator pole, 
which coincides with the kinematic meaning of mass. The renormalized electric charge is 
defined in terms of the electron-photon vertex when all of the external particles are on shell, 
ps = p? = m? and q? = (p2 — pi)? = 0. An alternative is the MS scheme (see, e.g., Peskin 
and Schroeder, 1995). One can define the renormalized charge é(,1) at scale u as eg—de, where 
eo is the bare charge and the counterterm ðe is chosen to absorb the (n — 4)~! poles that 
arise in the dimensional regularization of higher-order corrections, as well as some associated 
constants, ln 4r — yg, where yg ~ 0.5772 is the Euler-Mascheroni constant. Renormalized 
masses are defined similarly. The MS schemes are simple computationally and minimize 
some higher-order effects, and they lead automatically to running (~-dependent) couplings 
and masses. However, they are not as directly related to what is actually measured as 
the on-shell quantities. Of course, one can write theoretical expressions for all observables 
in terms of either set of renormalized quantities, and there are a-dependent translations 
between them. 

Now, let us consider the renormalization of sin? 0w in the SU(2) x U(1) theory (for 
reviews, see W. Hollik in Langacker, 1995; Sirlin and Ferroglia, 2013; Patrignani, 2016). In 
Section 8.2 we encountered a number of expressions for sin? Oy, all of which are equivalent 
at tree-level. These included 


sin? Ow =1- Mw (on-shell) sin? Oy cos? Oy = ——-—— (Z — mass) 
M3 , V2G rp M2 
12 
1 
sin? Ow = —2—, (MS), gf, =—= +2 sin? Oy (effective), (8.138) 
g +g? 2 


where gf, is the vector Zle coupling defined in (8.75). Each of the four expressions in (8.138) 
can be the basis of the definition of the renormalized sin? 0w, which will be denoted Sirs 
Shy, ŝ%, and 87, respectively, and in each case all observables can be expressed in terms 
of the renormalized sin? Oy as well as mų, My, as, etc. The four schemes are related by 
calculable corrections of O(a), which can, however, depend on m; and My. Each has its 
advantages and disadvantages. 

The on-shell scheme (Sirlin, 1980) defines sj, = 1 — Mj,/M? to all orders in perturba- 
tion theory, where My,z are the physical (pole masses). It is therefore based more on the 
spontaneous symmetry breaking (SSB) mechanism than on the WNC vertices. It is the most 
commonly used scheme, is simple conceptually, and is used in the program ZFITTER (Ar- 
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buzov et al., 2006). However, observables!’ involving the Zff vertex receive somewhat 
artificial m, and My dependence, and the dependence of Mz on m; is also somewhat en- 
hanced, so the experimental value s?y = 0.22336(10) has a larger uncertainty than the other 
schemes. Other drawbacks are that the mixed QCD-EW radiative corrections are large in 
the scheme, and it becomes awkward in the presence of any “beyond the standard model” 
physics that affects Mz or My. 

The Z-mass scheme is an alternative on-shell scheme (Novikov et al., 1993), which 
essentially uses the tree-level relation between Mz and Si, except the value of the running 
a( M3) is used. It is simple conceptually, and the value s5,,, = 0.23105(5) extracted from the 
measured Z mass has the smallest uncertainty because there is no m; or My dependence in 
the relation. (The uncertainty is mainly from a(M?#).) However, the m; and My dependence 
and their resultant uncertainties reemerge when other observables are expressed in terms 
of SMr The other difficulties of the on-shell scheme are shared. 

The other schemes considered here are based on the coupling constants and therefore 
the Z vertices more than on the SSB mechanism. The MS scheme defines a running 


DUS az (8.139) 


where g and ĝ' are defined by modified minimal subtraction (Marciano and Sirlin, 1981). 
The scheme, which is used in the radiative correction program GAPP (Erler, 1999), is 
simple theoretically, is convenient for comparison of running couplings with grand unification 
theories, and is closely connected with the scheme used in most QCD calculations, thereby 
minimizing the uncertainties in the mixed QCD-EW radiative corrections. Other advantages 
are that the value of §z is usually insensitive to new physics, the Z asymmetries are almost 
independent of m; and My for fixed z (except for Z > bb, which involves special vertex 
corrections involving the t), and the sensitivity of the value obtained from Mz is reduced 
compared to the on-shell value, leading to a smaller uncertainty in the experimental value, 
82, = 87}, (M2) = 0.23129(5). The running of 8% (u?) can be tested by determinations based 
on low energy WNC experiments, as shown in Figure 8.14. The agreement is generally good, 
though the deviation of the NuTeV deep inelastic measurement (Section 8.3.1) is evident. 
It is sometimes useful to define 3% = 83,(0), predicted to be 0.23865(8), for comparison with 
the present and future (more precise) low energy experiments (Kumar et al., 2013; Erler 
et al., 2014). The drawback of 5%, is that it is a “theorist’s” definition. All observables can 
be expressed in terms of it, but there is no simple defining relation. Just as in the on-shell 
scheme, the experimental value of 83, is generally obtained by a global fit to all of the data. 

The effective scheme, which has been used very successfully in expressing the results of 
the Z-pole experiments, defines an effective 5 for each type of fermion f in such a way 
that the formulae for the Z — ff partial widths and asymmetries take their tree level form 
when written in terms of effective axial and vector couplings 


94 = Pitt, Ib = VPs (te — 28745) (8.140) 


where pf is common to the vector and axial couplings. This form is very simple conceptually 
and has the advantage that the Z — f f asymmetries are almost independent of m; and My 
(except for f = b). The most precise is the charged lepton effective coupling, 57 = 0.23152(5), 
which is closely related to 8%. Drawbacks are that it is a phenomenological definition, which 


17It should be emhasized that even in the on-shell scheme the most precise experimental value of se, is 
obtained from a global fit of sty and the other parameters to all of the precision observables, and not just 
the value obtained from the defining relation. 
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in detail depends on how all of the radiative corrections to the processes observed near the 
Z-pole have been applied. Also, 5 is different for each f, and the scheme is not very useful 
for describing non-Z-pole observables. 


0,0001 0,001 0,01 0,1 1 10 100 1000 10000 
u [GeV] 


Figure 8.14 Running ô% (u°) measured at various scales, compared with the predic- 
tions of the SM, updated from (Czarnecki and Marciano, 2000; Erler and Ramsey- 
Musolf, 2005). eDIS refers to deep inelastic eD scattering. The Tevatron and LHC 
points are at u = Mz but are displaced for clarity. The discontinuities in slope, 
which are due to particle thresholds, can be smoothed out in a varient scheme 
defined in (Czarnecki and Marciano, 2000). Plot courtesy of the Particle Data 
Group (Patrignani, 2016). 


Radiative Corrections 


There are several classes of radiative corrections that must be applied to electroweak quan- 
tities. The first are the reduced QED corrections consisting of diagrams in which real and 
virtual photons are attached to charged particles in all possible ways, but not including 
vacuum polarization (self-energy) diagrams, as illustrated in Figure 8.15. For Z exchange 
processes this set is often finite and gauge invariant. However, these diagrams depend on the 


Figure 8.15 Typical reduced QED corrections to Z exchange. Vacuum polarization 
diagrams are not included. 
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energies, acceptances, and cuts, and need to be calculated and applied for each individual 
experiment. 

Gauge self-energy (oblique) diagrams for the yy, YZ, ZZ, and WW propagators involving 
fermion, gauge, and Higgs loops, as illustrated in Figure 8.16, are the most important. They 
lead to large m, and My dependence of Mw, Mz, partial widths, and four-fermi amplitudes, 
and to the major differences between the renormalization schemes. The dominant effects 
are quadratic in m and logarithmic in My, i.e., of O(am?) and O(aln My). One-loop 
self-energy diagrams sensitive to the top and Higgs masses are shown in Figure 8.17. These 
were significant in constraining the values of m; and My from the precision data. 
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Figure 8.16 Typical one-loop gauge self-energy diagrams, also known as vacuum po- 
larization or oblique diagrams. 
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Figure 8.18 Electroweak vertex and box diagrams, involving W and Z bosons. 


Electroweak vertex and box diagrams include those in which a W or Z is exchanged 
across a vertex, or box diagrams involving WW, W Z, or ZZ, as illustrated in Figure 8.18. 
These are usually small. However, they are needed for gauge invariance and are not always 
negligible, e.g., in effective four-fermi operators. They are usually absorbed into the param- 
eters in the effective interactions. Especially important are corrections to the Zbb vertex 
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involving the t quark, shown in Figure 8.19. Like the oblique corrections, these scale as am? 
for large m, and therefore make a significant contribution to Z — bb decay, especially the 
width. The Zbb vertex corrections were especially useful for distinguishing the effects of m; 
from the Higgs mass and from possible new physics contributions to the oblique corrections. 


Figure 8.19 One-loop contributions to the Zbb vertex involving the t. 


QCD and mixed QCD-electroweak diagrams involve gluon exchange between quarks in 
electroweak processes, as illustrated in Figure 8.20. They affect the hadronic W and Z decay 
widths at one-loop level, and can therefore be used to determine a,(M#) from the Z decays. 
They also affect the oblique corrections at two-loop level, where the dominant effects are of 
order aa,m?. 


x 
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Figure 8.20 QCD corrections to the Wqq or Zqq vertices and mixed corrections to 
an electroweak self-energy diagram. 
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My, Mz, and WNC Amplitudes at Higher Order. 


The tree-level expressions for Mw,z in (8.34) can be rewritten in terms of the independent 
variables a, Gyr, and sin? Oy as 
Ao Mw 


Mw = Mz = —— 8.141 
W sinyy’ 2 cos Oy’ ( ) 


where Ao = (1a/V2Gr)'/. All of the quantities in (8.141), i.e., Mw,z, a, Gp, and sin? Oy, 
are unrenormalized, though we do not indicate that explicitly for notational simplicity. 
Including radiative corrections, (8.141) is modified in the MS renormalization scheme to 


A M 
0 Mz w 


se eel ae 8.142 
3z(1 — Afw)? pêz’ eL 
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where 8? is the MS weak angle, ¢2 = 1 — 8%, Ap = (na/V2Gr)\/? = 37.28039(1) GeV 
using the physical values for a and Gr, and Afw collects the radiative corrections to My. 
(For comparison with Z-pole physics it is convenient to define Afw so that the running 87 
is evaluated at Mz.) The largest contribution to Afw is due to the running of a~! to the 
value @~1(M?) ~ 128 at the electroweak scale, and yields 


Afw ~ Arg =1- ~ 0.06630(13). (8.143) 


a 
â( M3) 
Including smaller effects, one expects Afw ~ 0.06952(13) in the SM. The large m — m, mass 
difference breaks SU (2) symmetry and leads, via the oblique corrections from the diagrams 
in Figure 8.17, to an increase that depends quadratically on m in the ratio Mw /Mz 
compared to the bare ratio. The quadratic dependence does not enter Afw because the shift 
in Mw has been absorbed into the observed strength of the WCC, i.e., into the measured 
Gr. Rather, it is shifted into the parameter ô in (8.142). There are other contributions to 


p, but the largest is from mz, 
prl+pi, (8.144) 


where the ubiquitous quantity p+ is (Veltman, 1977; Chanowitz et al., 1978) 


3Grm? Mt 2 
= ~ 0.00940 (a) 8.145 
Pe Bon? 173.34 GeV alae) 
In the on-shell scheme (8.141) becomes 
Ag Mw 
M Mz = — 8.146 
W sw(1 _ Ar)!/2’ Z cw , ( ) 


where sj, = 1 — M3, /Mż is the definition of the renormalized angle, cjy = 1 — sj, and Ao 
is the same as in the on-shell scheme. Ar collects all of the radiative corrections relating a, 
a(Mz), Gr, Mw, and Mz. The largest contributions to Ar are 


Ar ~ Arg — 2 (8.147) 
Ww 

where tw = sw/cw. Aro is due to the running of a~! and is the same as in the MS scheme. 
The p; dependence of Ar is enhanced by the tae in the on-shell scheme, due to the fact that 
the physical m; effect of increasing My /Mz forces a downward shift in Sty which must be 
compensated by Ar. There are additional contributions to Ar from bosonic loops, including 
those which depend logarithmically on My. Altogether, one expects Ar ~ 0.03648(31) in 
the SM. 

My has been measured separately at the Tevatron and at LEP 2 at the 0.04% level using 
event shapes and also the behavior of the e~et — W- W+ cross section near threshold. The 
results agree and can be combined to obtain My = 80.385(15) GeV, in agreement with the 
SM expectation of 80.361(6) GeV using the global best fit parameters. This is about 3.7% 
higher than the value of ~ 78 GeV that one would expect (for the same z) at tree-level, 
therefore confirming the radiative corrections, especially the running a. The value of Mz 
will be discussed in Section 8.3.5. 

The tree-level amplitude A? for a WNC process i is of the form 


A? = —£ F, (sin? 0w ), (8.148) 
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where F; is a function of i, such as defined in (8.79), and GY, and sin? Ow are the unrenor- 
malized quantities. The effect of higher-order corrections is to replace A? by the amplitude 


A; = Pay F(R) ro (8.149) 


where Gp is the renormalized Fermi constant defined by (7.51) on page 234, F; takes the 
same functional form as in (8.148), and s% is the renormalized quantity in scheme R. The 
three parameters pas kh, and re depend on the process and on the renormalization scheme, 
and one expects ph — 1, kå — 1, and A‘, to be of O(a). They are defined operationally, i.e., 
by the way that they modify the tree-amplitude (for an example, see Marciano and Sirlin, 
1980). Oblique corrections directly affect pi, but may also enter xi, through their effects 
on sj in the on-shell scheme. Vertex corrections typically affect pj, and «kh, while box 
diagrams may yield contributions to Ak. The Ap are usually small enough to be neglected. 
In the MS scheme one may write 


P = Pihoas (8.150) 


where ô is a universal part due to the shift of MŽ in (8.142). @ contains the quadratic me 
dependence in (8.144) and the dominant My dependence. fi „q and &’ are close to unity and 
(like Afw) depend only weakly on m, and My (with the exception of processes involving 
the Zbb vertex). In the on-shell scheme both pê and «ê have universal terms containing 
the quadratic m; and the dominant My dependence, and much weaker process-dependent 
contributions. The leading m;-dependent term is 


pi~ltpe, kê ~ 1+ p/tey, (8.151) 


where (similar to Ar) the «’ dependence is present to cancel the m, dependence of the 
vertices introduced by the definitions of s?y. 


The W Decay Width 


Consider the coupling g 
L=-fiy" (gv — 947°) fava + h.c., (8.152) 


of a massive vector V to fermions fı,2. (If V is Hermitian and fı = f2 then gy,4 are real 
and there is no +h.c.) The partial width for V > fı f2 is easily shown to be (Problem 2.25) 


r(V > fif) = iv (lav? + |gal?) (8.153) 


at tree-level and neglecting the fermion masses. Applying this result to leptonic W decays 
using (8.55) and (8.60), this implies 


2M M 
iw > etve) = ZV =- -2Y (8.154) 
48T 12 sin^ Ow 
and similarly for utv, and r™v,. It is convenient to rewrite (8.154) as 
GrM} 
(Wt > etve) = ZEW w 226.27(5) MeV (8.155) 


6/27 
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using g? My /4V2 = Gr Mj,. This form absorbs the dominant radiative corrections, includ- 
ing the running of a. The numerical values for the partial widths include the small residual 
electroweak corrections and fermion mass effect. Similarly, the hadronic partial widths are 


_ CGrM3, 


T(W* > uidi) = ———" |V,, |? = 705.1(3) |V|? MeV, 8.156 
( uidj) oyan | (3) |Vizl ( ) 
where V;; is the CKM matrix element, and 
z M2 2 3 4 
C=3 (1 y os(My) 1.40958 — 12.7758 80.0%) (8.157) 
T T vii Te 


includes the color counting factor and the four-loop QCD vertex correction for massless 
quarks. Again, there are small QED and electroweak corrections and one can include fermion 
mass effects. Adding the contributions, one predicts rw ~ 2.0888(7) GeV, in agreement 
with the experimental world average Tw = 2.085(42) GeV, which is based largely on decay 
distributions at LEP (Schael et al., 2013) and the Tevatron (Bandurin et al., 2015). 


8.3.5 The Z-Pole and Above 


As can be seen in Figure 8.21 the cross section for ee~ annihilation is greatly enhanced 
near the Z-pole. This allowed high statistics studies of the properties of the Z at LEP 
(CERN) and SLC (SLAC) in e~et + Z > 44+, qq, and vv (for reviews, see the articles 
by D. Schaile and by A. Blondel in Langacker, 1995; Grunewald, 1999; Schael et al., 2006a). 
The four experiments ALEPH, DELPHI, L3, and OPAL at LEP collected some 1.7 x 107 
events at or near the Z-pole during the period 1989-1995. The SLD collaboration at the 
SLC observed some 6 x 10° events during 1992-1998, with the lower statistics compensated 
by a highly polarized e~ beam with P.- = 75%. 
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Figure 8.21 Cross section for ete~ — hadrons as a function of CM energy. At low 
energies the cross section is dominated by 1 photon exchange in the s channel, while 
at high energies the Z dominates. The approximate energy ranges of various et e~ 
colliders are also shown. Reprinted with permission from (Schael et al., 2006a). 
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The basic Z-pole observables relevant to the precision program are: 
The lineshape variables Mz, Tz, and Gpeak (Appendix F). 


The branching ratios for Z to decay into e~e+, “put, or Tr"; into qq, cé, or bb; 
or into invisible channels such as v7 (allowing a determination of the number N, = 
2.992 + 0.007 of neutrinos lighter than Mz/2). 


Various asymmetries, including forward-backward (FB), hadronic FB charge, polar- 
ization (LR), mixed FB-LR, and the polarization of produced 7’s. 


The branching ratios and FB asymmetries could be measured separately for e, u, and T, 
allowing tests of lepton family universality. 

LEP and SLC simultaneously carried out other programs, most notably studies and tests 
of QCD, and heavy quark physics. 


The Z Lineshape 


One of the most important observables is the Z lineshape, i.e., the cross section for eet > 
ff, where f = e, ,7, b,c, or hadrons, as a function s = E? m near the Z-pole. The expected 
Breit-Wigner form! (slightly more sophisticated than the one in (F.2)), is 


(8.158) 


sT? 
of(s) vo Z s2T3 9 


2 
(s — M2) + ae 


where of is the peak cross section and Iz is the total Z width. By measuring the cross 
section at a number of energies near the peak one can determine Mz, Iz, and ay, as 
illustrated in Figure F.1. For example, the cross section Chaa for e~e* — hadrons is shown 
in Figure 8.21. The results for the Z are generally expressed in a model-independent way 
in terms of the partial widths (ff) for Z > ff and in terms of effective couplings to be 
defined below. From Problem 2.27 the expression for the peak cross section is 


120 (eet )P(ff) 
of = MZ rZ . 


(8.159) 


Using (8.72) and (8.153), the width for Z — ff at tree-level is 


a _ Cr(g?+97)Mz (fo, or CraMz f2 f2 
TD gee (+98) = yan oy otag HA) 6100) 


where Cp = 1 (leptons) and 3 (quarks) is the color factor, g/, , are defined in (8.75), and 
f VA 


we have neglected the fermion masses. Including radiative corrections, I (f f) becomes 


n _ Cra(MZ)Mz /_f2 f2 CyGrM3 . (72 f2 
A = Se (af L er a (af +5), (8.161) 


Sie Cyr 
where for quarks Cy now includes QCD corrections similar to (8.157) evaluated at M? and 
a, a are effective vertex factors 


Gi, = Sot, ab = bs (Èr — 2h p9582) , (8.162) 


18There are non-negligible corrections from reduced QED diagrams, including initial state photon radi- 
ation and s-channel photon exchange, as well as y — Z interference. These are taken into account, usually 
assuming the SM expressions, in determining the Z parameters. 
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which are the same as the tree-level expressions in (8.76) except for the electroweak radiative 
correction factors ôs and Ås. Note the similarity of this parametrization to the one in (8.149) 
for WNC amplitudes. The second form in (8.161) is obtained using (8.142), and contains the 
quadratic m, dependence of (8.144) as well as the largest My dependence in the universal 
factor p. Many types of corrections enter into pf and Åp, but the overall corrections are 
small and only weakly sensitive to m, and My. (The fy are analogous to the ĝi q of 
(8.150).) For example, one expects je ~ 0.9980 and Êg ~ 1.0010. The one exception is f = b 
because of the t contributions to the vertex in Figure 8.19. For the known m, one has the 
rather large corrections py ~ 0.9868, Ra ~ 1.0065. In practice, (8.161) must be extended 
to include fermion mass effects (including those within the radiative corrections), two-loop 
QED corrections, two-loop mixed QED-QCD corrections, etc. The partial width can also 
be written in the on-shell scheme as 


mr , Cr@rMz (_f2 f2 
(F~ a (ai: + gh”) (8.163) 
with 
a= VPFË L» at, = Vor (Èr — 2npapsy) - (8.164) 


We have used the same notation for a a in both schemes to avoid a proliferation of symbols, 
but they differ in detail, e.g., there is no analog of ô in (8.163), with the effects now absorbed 
into the pr. The dominant m, dependence is pf ~ 1+ pr, Kf ~ 1+ pr/thy (cf. (8.151). 

It is convenient to define an effective angle 54 


54 = Kp Sty = Ry SZ. (8.165) 


fp is insensitive to m; and My, so one has 52 ~ 87 + 0.00023. The effective couplings 
can then be written as in (8.140), i.e., they take the form of tree-level expressions up to 
an overall factor of ,/py, where py depends on the renormalization scheme. The Z-pole 
analyses often extract the aL a from the data in a model-independent way. 

Using (8.161) or (8.163) and additional corrections, the SM predictions are 


f 299.91 +0.19 MeV (ut), 167.17 + 0.02 MeV (vv) 


(ff) ~ 4 382.80 +0.14 MeV (dd), 83.97 + 0.01 MeV (ete7) (8.166) 
375.69 = 0.17 MeV (bb) 


for wa, = 0.1182(16) and the observed m; and My. The small partial width into ete is 
due to the small value of gf, which would vanish for 52 = 1/4. The total width is predicted 
to be Tz ~ 2.4943(8) GeV, compared with the experimental value 2.4952(23). 

The results of the four LEP experiments have been carefully combined by the LEP 
Electroweak Working Group (LEPEWWG) (Schael et al., 2006a, LEPEWWG website), 
which took into account common systematics, corrected the data for non-Z effects, applied 
radiative corrections, and carried out SM and model-independent analyses of the Z-pole and 
other data, in collaboration with SLC, Tevatron and other LEP working groups. For the 
Z lineshape, results were typically presented in terms of a conventional set of observables: 
Mz, Tz, Ohad; Re, Ro, and Ro; where 


127 T(ete~)I'(Z > hadrons) 


ad = 8.167 
Ohad = MZ rZ ey) 
is the peak cross section into hadrons, and 
Ci) (had) 
R, = i = b, 5 Ry = —~ 5 li = >t, 8.168 
qi T(had)’ q ( c) Li T (l;t) (e H T) ( ) 
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are ratios of partial widths to the total width into hadrons, in which many of the radiative 
corrections and uncertainties cancel. This set had the benefit of being weakly correlated, 
though the precision still required careful treatment of the full error correlation matrices. 

The principal Z-pole observations from LEP and SLC are listed in Table 8.5, along with 
the standard model expectations using the global best fit values for the input parameters. 
The precision of the Mz determination, to ~0.0023%, is unprecedented in this high energy 
regime. The energy calibration was done using a resonant depolarization technique, and 
corrections had to be made for the tidal effects of the Sun and Moon, the water table and 
the water level in Lake Geneva, and leakage currents produced by nearby trains! Many of 
the other quantities are measured to the 0.1-1% level, and the agreement with the predicted 
values from the fit is generally quite good, though there are one or two exceptions. 


TABLE 8.5 Principal Z-pole observables, their experimental values, theoretical predictions 
using the SM parameters from the global best fit, and pull (difference from the prediction 
divided by the uncertainty).* 


Quantity Value Standard Model Pull 
Mz [GeV] 91.1876 + 0.0021 91.1880 + 0.0020 —0.2 
Tz [GeV] 2.4952 + 0.0023 2.4943 + 0.0008 0.4 
T (had) [GeV] 1.7444 + 0.0020 1.7420 + 0.0008 = 
T (inv) [MeV] 499.0 + 1.5 501.66 + 0.05 = 
T(é+é-) [MeV] 83.984 + 0.086 83.995 + 0.010 — 
Shaa [nb] 41.541 0.037 41.484 + 0.008 1.5 
Re 20.804 + 0.050 20.734 + 0.010 1.4 
Ru 20.785 + 0.033 1.6 
Ry 20.764 + 0.045 20.779 + 0.010 —0.3 
Rs 0.21629 + 0.00066 0.21579 + 0.00003 0.8 
Re 0.1721 + 0.0030 0.17221 + 0.00003 0.0 
ASS 0.0145 + 0.0025 0.01622 + 0.00009 —0.7 
Ahh 0.0169 + 0.0013 0.5 
ARD 0.0188 + 0.0017 1.5 
A% 0.0992 + 0.0016 0.1031 0.0003 —2.4 
ASS 0.0707 0.0035 0.0736 + 0.0002 —0.8 
AS 0.098 + 0.011 0.1032 + 0.0003 -0.5 
33(A:4) (LEP) 0.2324 + 0.0012 0.23152 + 0.00005 0.7 
52(A%6) (Tevatron) 0.23185 + 0.00035 0.9 
52(A%6) (LHC) 0.23105 + 0.00087 —0.5 
Ae (hadronic) 0.15138 + 0.00216 0.1470 + 0.0004 2.0 

(leptonic) 0.1544 + 0.0060 1.2 
(P,) 0.1498 + 0.0049 0.6 
Au 0.142 + 0.015 -0.3 
A, (SLD) 0.136 + 0.015 —0.7 
(P,) 0.1439 + 0.0043 —0.7 
A 0.923 + 0.020 0.9347 —0.6 
Ac 0.670 + 0.027 0.6678 + 0.0002 0.1 
A; 0.895 + 0.091 0.9356 —0.4 


“From the Electroweak review in (Patrignani, 2016). (had), T (inv), and T'(¢+£—) are not independent. 
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The individual lepton channels could be measured separately, allowing the lepton uni- 
versality prediction Re = R, ~ R, (up to a small r mass correction) to be tested. The data 
are in agreement with universality within the uncertainties, and can therefore be combined 
to form an average lepton ratio Re. The LEP values of Rọ are shown in Figure 8.22 and 
compared with the predictions of the SM as a function of the Higgs mass My, obtained 
using the then known values for Mz, m4, and ag. 

Other quantities can be derived from the conventional set, such as the hadronic and 
leptonic widths, T (had) and I'(¢2). Especially important is the invisible Z width determined 
in principle by subtracting the hadronic and charged lepton widths from the total (lineshape) 
one, 


[(inv) = Pz —T(had) — x T (4t) (8.169) 


In the SM T (inv) is due to the (unobserved) neutrinos. It can be predicted from the other 
observables provided one knows the number N,, of neutrino flavors, implying that the line- 
shape observables are not all independent for a given N,. The hadronic cross section is 
shown as a function of ys in Figure 8.23 compared with the SM prediction for 2, 3, and 4 
flavors of light neutrinos. It is seen that the data are only consistent with N, = 3. One can 
quantify this by allowing N, = 3+ AN, to be a free parameter, with 


T(inv) = (3+ AN,)I(v0), (8.170) 


where ['(v7) is the theoretical width in (8.166) for a single neutrino flavor. One obtains!? 
N, = 2.992 + 0.007 or AN, = —0.008(7). Thus, a fourth family with a light neutrino is 
clearly excluded. (The exclusion is valid for m,, almost as large as Mz/2. However, it does 
not apply to right-handed (sterile) neutrinos, which do not couple directly to the Z.) The 
result for (inv) is also important for constraining other types of possible extensions to 
the SM, because it would receive contributions from any decays into unobserved particles. 
This is conveniently parametrized by (8.170), even when AN, has nothing to do with 
extra neutrinos. For example, AN, would be 4 for a single flavor of light scalar neutrino 
(Z — VE) in supersymmetry, while AN, = 2 sould be expected in triplet Majoron neutrino 
mass iiodels with spontaneous L violation, from the decay of the Z into a pseudo-Goldstone 
boson (Majoron) and a light scalar (Chapter 9). 


Z-Pole Asymmetries 


Most of the Z-pole asymmetries?’ can be expressed in terms of the quantities 
P y 


ozta 

Ay = "4, (8.171) 
Iv +94 

where i. a are the effective couplings defined in (8.140). The asymmetries are easily calcu- 


lated (ignoring the fermion masses) using (8.127) and (8.128) and the effective coupling 


fee "(a = J4) Zu (8.172) 
19This value was obtained from the global fit including all correlations. The definition used by the LEP 
collaborations, Ny = Henn (FEB) ar , where T (24) is measured and the last factor is the theoretical ratio 


in the SM, yields N, = 2.984(8). 

20The Born asymmetries A? considered here are the idealized asymmetries obtained after subtracting 
reduced QED and off-pole effects from the data, and dividing by the beam polarization P,- for the SLC 
results. 
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Forward-Backward Pole Asymmetry 
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Figure 8.22 Re and A from the four LEP experiments, compared with the SM 
predictions as a function of My using the observed Mz. Both results are consistent 
with My = 125 GeV. Reprinted with permission from (Schael et al., 2006a, with 


LP05/EPS05 updates). 
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Figure 8.23 Lineshape for Z —> hadrons, compared with the expectation for 2, 3, and 
4 ordinary neutrinos lighter than ~ Mz/2. The asymmetry is due to the explicit 
factor of s in the numerator of the Breit-Wigner formula (cf. Equation F.7) and to 
radiative corrections. Reprinted with permission from (Schael et al., 2006a). 
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For example, the forward-backward asymmetry A% p for eet > f f at the Z-pole, defined 
by (8.125) just as in the low-energy case, is 


3 
Af = qAeAr. (8.173) 


The LEP experiments measured Aas for f = e, 4,7, b,c, and (less precisely) s. These and 
other asymmetries are listed in Table 8.5. The individual measurements of A%,, Ay B and 
A, are in reasonable agreement with lepton universality, allowing them to be combined 
into an average A%p. The LEP experiments also measured a forward-backward asymmetry 
between positive and negative charge in hadronic Z events. This is more complicated to 
interpret because it depends on the fragmentation of the final quarks, but nevertheless 
leads to a measurements of the effective leptonic angle that is denoted (APs) in Table 
8.5. 

Forward-backward asymmetries in pp — 447, £ = e or u, have been measured by CDF 
and DO at the Tevatron, and in pp > 4+4- by ATLAS, CMS, and LHCb at the LHC.?! 
These (Drell-Yan) processes are mediated by y and Z exchange, and for ¿+47 masses in the 
vicinity of Mz they lead to high precision determinations of sin? 6y comparable to those of 
the Z-pole experiments (e.g., Patrignani, 2016), as shown in Table 8.5. 

As discussed following (7.53), the polarization of a rF in e~et + 7~7* can be mea- 
sured from the angular distribution of the 7 decay products in the 7 rest frame. The 77 
polarization, averaged over the 7~ CM angle 0, is 


p? = a(h,- = +5 
J o(h,- = +4) + o( g= = —4) 


=-A,, (8.174) 


which allows the extraction of A,. Similarly, the polarization of t~ produced at a definite 
angle 0 is 
A, af A 2z 


ie (8.175) 


14+ A4,Ac 2s’ 


Pz) = 


where z = cos0, allowing Ae to be determined from the angular distribution. 
The SLC had a longitudinally polarized e~ beam, with a reversible polarization of 
= 75%. This allowed them to measure the polarization asymmetry 


o(h.- = — 
o(he- = — 


) m a(he- SGR 


A= 
ne ) + o(he= ae 


= Ae, (8.176) 


which projects out the electron couplings. The measurements were done separately for 
hadronic and leptonic final states. They also measured mixed forward-backward polarization 
asymmetries for various final states, 


f f f f 
Orp — o Onp to 3 
ASEB LF LB RE RB Ay (8.177) 


ofp tolp + ohr +ohp 4 


in an obvious notation, allowing them to determine the final fermion couplings. 
The LEP and SLD lineshape and asymmetry results can be combined to determine the 
: ; _f ‘ ; TRN, 
effective couplings gy 4. The results are displayed in Figure 8.24. 


?1Forward-backward asymmetries are possible in pp reactions because of the difference between the PDFs 
for quarks and antiquarks at nonzero rapidity. 
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Figure 8.24 Allowed regions in a vs DA at lo (39.4% c.l.) for f = e, u, T, compared 


with the SM prediction as a function of 8%. The 90% c.l. region for Iv, A assuming 
lepton universality, and the global best fit value 53 = 0.23129 are also indicated. 
Plot courtesy of the Particle Data Group (Patrignani, 2016). 


Af involves the ratios of effective couplings, so py cancels. Ay and the asymmetries 
are therefore insensitive to m, and My for a fixed 5? or 8%, allowing the weak angle to 
be extracted with little ambiguity. (Similar statements apply to Mw and to Re and Ry in 
(8.168), although some m; dependence enters the ratios through the Zbb vertices.) However, 
Mz and the other lineshape variables do depend on them through /, so the m, and My 
constraints come mainly from comparing Mz with the asymmetries and other observables, 
or, equivalently, by expressing the other quantities in terms of Mz rather than 57. As an 
example, the observed AGE as well as Rọ are compared to the SM expectation as a function 
of My in Figure 8.22, where the SM predictions utilize the observed Mz. These observables 
favored relatively low values for the Higgs mass (prior to its discovery), but with large 
uncertainty due to the logarithmic dependence. 

There is some tension between the leptonic and hadronic asymmetries, as can be seen 
in Figure 8.25. The most precise determinations of 57 are from the polarization asymmetry 
Arr and from the b FB asymmetry Ap. Arr is proportional to gf, ~ —4 + 237, and is 
therefore extremely sensitive to 52. On the other hand, the leptonic FB asymmetry AQ, is 
proportional to gf? and is therefore less sensitive since gf, is small. A9?, = SA Ay does not 
have this difficulty, and is much more sensitive to the e vertex than the b vertex assuming 
the SM. As can be seen from Table 8.5 and Figure 8.25, there is a possible discrepancy 
of AY’, from the value predicted from the best fit, and the corresponding 5? is about 2.40 
higher than the average. Similarly, the 5? from Azp is ~ 2c lower. This could well be due to 
fluctuations, but the other (less precise) hadronic and leptonic asymmetries tend to follow 
the same pattern. The hadronic asymmetries favor a larger Higgs mass, while the leptonic 
ones (and My) favor a small My, so the average value relies on a delicate balance. As can 
be seen in Figure 8.25 (which predated the Higgs observation), the average was consistent 
(with large uncertainties) with the value My ~ 125 GeV that was subsequently observed. 
Also, the recent precise Tevatron value 54 = 0.23185(35) fits comfortably in between. 
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Figure 8.25 57 from various leptonic and hadronic asymmetries, compared with the 
SM predictions as a function of My using the observed Mz. Reprinted with per- 
mission from (Schael et al., 2006a). 


It is possible that there is some kind of new physics that affects A9?, and Ayr, and that 
their combined implication for My was fortuitous. For example, there could be a tree-level 
effect that couples preferentially to the third family and accounts for the A%?, value (it 
would be hard for a new physics effect that only enters at loop level to lead to the needed 
4% shift). Possibilities would include a flavor off-diagonal coupling of a heavy Z’ gauge 
boson or the mixing of the b with a heavy exotic fermion. However, there would have to be 
a compensation between the L and R couplings to avoid a large change in Rp». 


Precision Tests at High Energy 


The second phase of LEP, LEP 2, ran at CERN from 1996-2000, with energies gradually 
increasing from ~ 140 to ~ 209 GeV (Schael et al., 2013). The principal electroweak results 
were precise measurements of the W mass, as well as its width and branching ratios; a 
measurement of ete” ~ WtW-, ZZ, and single W or Z, as a function of center of 
mass (CM) energy, which tests the cancellations between diagrams that is characteristic 
of a renormalizable gauge field theory, or, equivalently, probes the triple gauge vertices; 
limits on anomalous quartic gauge vertices; measurements of various cross sections and 
asymmetries for ete” — ff for f = e~,u-,7~,q,6 and c in reasonable agreement with 
SM predictions; and a stringent lower limit of 114.4 GeV on the Higgs mass. LEP 2 also 
studied heavy quark properties, tested QCD, and searched for supersymmetric and other 
exotic particles. 

The Tevatron pp collider at Fermilab ran from 1983-2011, with an ultimate CM energy of 
1.96 TeV (Bandurin et al., 2015). The CDF and D0 collaborations there discovered the top 
quark in 1995, with a mass consistent with the predictions from the precision electroweak 
and B/K physics observations; measured the t mass, decay distributions, tt cross sections 
and distributions, and single t production (relevant for the CKM element V;,); and carried 
out an extensive program of electroweak physics. This included measurements of the W 
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mass, width, and WF charge asymmetry (relevant to quark PDFs); 4+4- forward-backward 
asymmetries, as part of the precision program; and diboson (WW, ZZ, ZW, Zy, Wy) pro- 
duction, to constrain anomalous triple gauge couplings and test QCD predictions. The 
Tevatron collaborations also searched for the Higgs boson (and later constrained its proper- 
ties), observed B, — B, mixing and other aspects of B physics; carried out extensive QCD 
tests; and searched for heavy W’ and Z’ gauge bosons, exotic fermions, supersymmetry, 
and other types of new physics. Similar studies are being carried out at higher energy at 
the LHC (pp), which ran at CERN at 7 and 8 TeV during the first run (2010-2013), with 
results that included the discovery and study of the Higgs boson. The LHC restarted at 13 
TeV in 2015, with an eventual goal of 14 TeV. 

The HERA e*p collider at DESY observed W propagator and Z exchange effects, probed 
electroweak couplings, searched for leptoquark and other exotic interactions, and carried out 
a major program of QCD tests and structure functions studies (e.g., Diaconu et al., 2010). 


8.3.6 Implications of the Precision Program 


The principal Z-pole observables are listed in Table 8.5, and a number of important non- 
Z-pole observables in Table 8.6. The direct measurement of m+ is important for controlling 
the radiative corrections in the analysis more precisely than could be done from the indirect 
constraints alone, and in allowing sensitivity to the Higgs mass. Mw provides information 
on radiative corrections independent of the Z lineshape and asymmetries. The low energy 
WNC measurements play a critical role in constraining many types of new physics even 
though they are less precise than the Z-pole observations, because the latter are essentially 
blind to any new physics that doesn’t directly affect the Z or its couplings to fermions (such 
as heavy particle exchanges, new box-diagrams, or four-fermi operators). 


TABLE 8.6 Principal non-Z-pole observables, as of 11/15.% 


Quantity Value Standard Model Pull 
me 173.34 + 0.81 173.76 + 0.76 —0.5 
Mw (Tevatron) 80.387 + 0.016 80.361 + 0.006 1.6 
Mw (LEP 2) 80.376 + 0.033 0.4 
Tw (Tevatron) 2.046 + 0.049 2.089 + 0.001 -0.9 
Tw (LEP 2) 2.195 + 0.083 1.3 
My 125.09 + 0.24 125.11 + 0.24 0.0 
oY —0.040 + 0.015 —0.0397 + 0.0002 0.0 
g —0.507 + 0.014 —0.5064 0.0 
Qty (Maller) —0.0403 + 0.0053 —0.0473 + 0.0003 1.3 

or 0.064 + 0.012 0.0708 + 0.0003 -0.6 
Qw(Cs) —72.62 + 0.43 —73.25 + 0.02 1.5 
Qw (T) —116.4+ 3.6 —116.91 + 0.02 0.1 
8%, (eDIS) 0.2299 + 0.0043 0.23129 + 0.00005 -0.3 


“From (Patrignani, 2016). Masses are in GeV. Other non-Z-pole observables are included in the fits but 
not listed. 
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The Standard Model Fit 
As of November, 2015, the result of the global fit to the precision data yielded?” 


My = 125.11(24) GeV, m = 173.76(76) GeV (8.178) 
as(Mz) = 0.1182(16), a(MzZ)~* = 127.936(17), Aaf), = 0.02774(13) 
83 = 0.23129(5), 3% = 0.23152(5), Siy = 0.22336 (10), 


with an overall x? /df of 53.6/42. The three values of the weak angle s? refer, respectively, to 
the MS, effective Z-lepton vertex, and on-shell values, defined after Equation (8.138). The 
latter has a larger uncertainty because of a stronger dependence on the top mass. Aaf), is 
defined after (8.137). 

The value of a,(M?) in (8.178), based on Z-pole data and hadronic 7 decays, agrees 
well with the other low-energy determinations mentioned in Section 5.4. The Z-pole results 
alone yield a;(M#?) = 0.1203 + 0.0028, within lo of the global world average 0.1181(13). 
The Z-pole result has negligible theoretical uncertainty if one assumes the exact validity of 
the standard model, and is also insensitive to oblique (propagator) new physics. However, 
it is sensitive to non-universal new physics, such as those which affect the Zbb vertex. 

The precision data alone yield?’ m; = 176.7+2.1 GeV from loop corrections, in impres- 
sive agreement with the direct collider value 173.34 + 0.81 GeV. The fit actually uses the 
MS mass rh: (74), which is ~ 10 GeV lower, and is converted to the pole mass at the end. 

The electroweak precision observables depend logarithmically on the Higgs mass through 
the oblique diagrams in Figure 8.17. This My dependence is correlated with the stronger 
quadratic m; dependence, but following the direct measurement of m, it became possible 
to derive nontrivial restrictions on My. For example, the 2011 Particle Data Group fit 
predicted 68 GeV < My < 155 GeV at 90% c.l. from the indirect data, while the current 
data (excluding the direct Higgs observations) yields 66 GeV < My < 134 GeV. This is 
consistent with the direct measurement of ~ 125 GeV (but see the caveats mentioned in 
the discussion of Figure 8.25). The individual constraints can be seen in Figure 8.26. 

The My prediction is fairly robust to many types of new physics, with some exceptions. 
In particular, a larger My would have been allowed for negative new physics contributions to 
S or positive contributions to T, where S and T are the oblique parameters to be discussed 
below (see Peskin and Wells, 2001, for some specific examples), or for some Z’ models with 
Z — Z' mixing (e.g., Langacker, 2009). The predicted value would decrease if new physics 


accounted for the value of Ae?) (Chanowitz, 2002). Following the direct observation one 
can view My as a constraint on these extensions. 


22The fit results are from the Particle Data Group (Patrignani, 2016), which uses the fully MS pro- 
gram GAPP (Erler, 1999) for the radiative corrections. The results are generally in good agreement with 
those of other groups, such as those of the LEPEWWG (Schael et al., 2006a, and website) or the Gfitter 
group (Flacher et al., 2009, and website), which use the on-shell scheme. However, the PDG fits use a more 
complete set of low energy data. 

?3Prior to LEP and the SLC the WNC current experiments and early W and Z masses could set an 
upper limit on mz of O(200 GeV) (see, e.g., Amaldi et al., 1987). The more precise Z-pole experiments 
led to a prediction for mz that was correlated with Mp, which was consistent (within the rather large 
uncertainties) with the value later observed directly by CDF and DO. For example, the first measurement of 
Mz by MARK II at the SLC already implied m = 140783 GeV for My = 100 GeV, with the central value 
changing to 128 (165) GeV for My = 10 (1000) GeV (Langacker, 1989a). By the late 1980s observations 
of B — B oscillations and improved analyses of CP violation in K decays made possible by measurements 
of the relevant CKM matrix elements allowed independent lower limits on m to be set, e.g., mt > 130 
GeV (Buras, 1993). 
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The predicted My range and observed value are suggestive of (but certainly do not 
prove) the possibility that the SM is extended to a supersymmetric version at the TeV scale, 
since such theories predict a light SM-like Higgs for much of their parameter space. However, 
My = 125 GeV is on the upper end of the range allowed in the minimal version, the MSSM. 
The theoretical constraints on My in both the standard model and its supersymmetric 
extensions will be discussed in Sections 8.5 and 10.2.5. 
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Figure 8.26 1a (Ay? = 1) allowed regions in My vs m and the 90% c.l. global fit 
region from precision data, compared with the direct measurements from the LHC 
and Tevatron. Plot courtesy of the Particle Data Group (Patrignani, 2016). 


Beyond the Standard Model 


New physics modifies the SM predictions for the precision electroweak observables in sev- 
eral ways. For example, new heavy W’ or Z’ gauge bosons (Section 10.3); exotic Higgs 
particles (Accomando et al., 2006); leptoquark bosons (Barbier et al., 2005; DorSner et al., 
2016), such as occur in supersymmetric models with R-parity violation; or supersymmetric 
particles in box diagrams (Section 10.2) can lead to new four-fermi operators, which are 
especially important below or above the Z-pole (Cho et al., 1998; Cheung, 2001; Han and 
Skiba, 2005; Schael et al., 2013). The most important effects for the Z-pole experiments 
are: (a) new physics that affects the Zf f vertices, and (b) new physics that modifies the 
masses and propagators of the W and Z. The first class includes such tree-level effects 
as the mixing of the ordinary with heavy exotic fermions (Langacker and London, 1988b) 
and Z-Z’ mixing, as well as actual vertex corrections from new gauge bosons, leptoquarks, 
superpartners, etc. Their implications are model dependent and hard to parametrize in 
general, since they affect each type of chiral fermion in a different way. For this reason the 
precision constraints are usually studied for specific models or types of models. The second 
class, the oblique corrections, are described below. 
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The Oblique Parameters 


The oblique corrections are universal in that they are independent of the fermions in the 
process, and can be described by a small number of parameters. They are especially impor- 
tant for describing new particles that only or mainly affect the precision observables through 
vacuum polarization loops, such as new fermion or scalar multiplets with little or no mixing 
with or direct coupling to the ordinary particles. They can also be used to parametrize the 
effects of the t quark and Higgs on the radiative corrections (although for the top one must 
also include the Zbb vertex diagrams in Figure 8.19). However, we will follow the Particle 
Data Group (Patrignani, 2016) and focus on the new physics contributions to the oblique 
parameters, treating the m, and My effects separately. 

The pp parameter 

MZ, 
po = Zae 

extends the MS parameter ô in (8.142). The SM expression for My in (8.142) is unmodified. 
po is associated with new physics that breaks the SU(2) vector generators and therefore 
violates the custodial symmetry relation between My and Mz in (8.141) (and therefore 
between the WNC and WCC amplitudes). We assume that the new physics that leads to 
po Æ 1 is a small perturbation that does not significantly affect other radiative corrections. 
Then po can be viewed as an extra parameter that also multiplies Gr in the expressions for 
the effective WNC interaction in (8.79), the quantity yo in (8.133), and the partial Z-widths 
n (8.163). 

po # 1 can be generated by non-degenerate multiplets of heavy particles, analogous to 
the effect of the non-degenerate t and b quarks in (8.144). Non-degenerate SU(2) doublets 
of scalars and/or fermions with masses mn1,Mn2, such as a fourth family (sequential), a 
mirror family (involving right-chiral doublets and left-chiral singlets), vector pairs of quark 
or lepton doublets (with an extra factor of 2), heavy Higgs doublets not involved in symmetry 
breaking, or unmixed squark or slepton doublets in supersymmetry, yield (Veltman, 1977; 
Chanowitz et al., 1978) 


(8.179) 


= “Am 8.180 
po T aV 3 ( ) 
where Bath 
4 n 
An? = m2; +m?» Tng Jy A > (Mni — Mn)’, (8.181) 
Mp1 — Ma2 Mn2 


with Cn = 1(3) for color singlets (triplets). There is enough data to determine pp and the 
other parameters, with the global fit yielding 


po = 1.00037 + 0.00023, (8.182) 


~ 1.60 from unity, with little change in the other parameters. The po value corresponds to 
the 95% c.l. limit 


D Ain < (49 GeV)?. (8.183) 


n 


Such non-degenerate multiplets lead to pọ > 1. However, additional Higgs doublets that 
participate in SSB or heavy lepton multiplets involving Majorana neutrinos can yield pọ < 1 
(for references, see the Electroweak article in Patrignani, 2016). Tree-level effects, such as 
triplet or higher-dimensional Higgs representations with non-zero VEVs can contribute to po 


?4One must still define a fourth renormalized parameter. A convenient set is a, Gr, Mw, and Mz. 
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with either sign (Problem 8.1). Mixing of the Z with a heavy Z’ would reduce Mz, increasing 
po, while W — W” mixing would have the opposite effect. However, such mixings could also 
have other important effects on the WNC, WCC, and precision observables (Section 10.3). 
Heavy chiral fermions break the axial SU(2) generators as well as the vector ones, so their 
effects are not adequately parametrized by po alone. These can be taken into account by a 
generalization to the S, T, and U parameters (Peskin and Takeuchi, 1990, 1992). Related 
parametrizations are given in (Kennedy and Langacker, 1990; Marciano and Rosner, 1990; 
Golden and Randall, 1991; Altarelli and Barbieri, 1991). Define the vacuum polarization 
functions 
I (q) = if” Tyl?) — ag” Asa?) (8.184) 
as the one-particle irreducible propagator corrections shown in Figure 8.27. The second term 
is irrelevant for the existing precision experiments because the q” or q” always yield light 
lepton masses when contracted into the external fermion lines. We are concerned with the 


ia Ea 


Figure 8.27 Vacuum polarization bubbles for the gauge boson propagators. Examples 
are shown in Figure 8.17. 


new physics contributions IT?" (q?) to the vacuum polarizations. As long as the new physics 
is much heavier than Mz it is sufficient to assume that the IIg” (q?) are approximately linear 
from q? = 0 to MŽ. The effects on the precision observables can then be expressed in terms 
of the three parameters 


[prew Tew 


M3, Mz 
(ME) g T (M3) — m O) n 
ee = MZ l 
485 My 


Equation (8.185) assumes an MS renormalization scheme as defined in (Marciano and 
Rosner, 1990). The more general expressions are given in (Patrignani, 2016), along with 
additional details and references. These quantities have a factor of œa removed, so new 
physics contributions are expected to be of O(1). The T parameter is equivalent to po, 


~1+aT, (8.186) 


Po = Tat 
while S and U are associated with the axial currents and can be generated even by degen- 
erate heavy chiral multiplets. The contribution of a degenerate multiplet to S is 


pa D E (8.187) 


3m 
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where t3,, (tp) is the T? eigenvalue of 7,7, (WR). A heavy degenerate fourth family or mir- 
ror family would contribute 2/37 to S. Similarly, in technicolor models (Weinberg, 1979; 
Susskind, 1979) with scaled up QCD-like dynamics, one expects S ~ 0.45 for an SU(2) 
doublet of technifermions with Nrc = 4 technicolors, and S$ ~ 1.62 for a full technigener- 
ation (Peskin and Takeuchi, 1992; Golden and Randall, 1991). Most types of new physics 
yield U = 0. 

The effects of T on the precision observables are the same as po (using (8.186)). S and 
U mainly modify the SM expressions for the Z and W masses 


1— aT 
1 — GpM3,9/2V20 
1 
1 — Gr M? (S +U)/2V2n’ 


MŽ = M2, 
(8.188) 


Mw = Mivo 


where Mzo and Myo are the SM expressions in the MS scheme (see Patrignani, 2016, for 
the modifications to the widths, for which wave function renormalizations must be included, 
and to the WNC amplitudes). The global fit yields 


S =0.05(10),  T=0.08(12), U =0.02(10) 


S = 0.07(8), T = 0.10(7), (8.189) 


where U is fixed to be zero in the second line, consistent with the SM values S = T = U = 0. 

A heavy degenerate fourth or mirror family, which corresponds to T = 0, is strongly 
excluded. This complements the invisible Z width constraint below (8.170) on a fourth 
neutrino with mass < Mz/2. A heavy fourth family with large mass splittings may be 
marginally allowed by the precision data due to a compensation between the effects of S > 0 
and T > 0, though at the expense of large Yukawa couplings that may lead to Landau poles 
at low scales. However, in combination with direct search limits from the LHC and the 
properties of the Higgs,?° a perturbative fourth or mirror family is now excluded by more 
than 50 (Eberhardt et al., 2012). Similarly, the QCD-like technicolor models are excluded, 
but versions that do not resemble QCD in their dynamics may be allowed. Supersymmetric 
extensions of the SM usually give very small contributions to S and T (or to the precision 
observables). The implications for other types of new physics are reviewed in the Electroweak 
and Extra Dimensions articles in (Patrignani, 2016). 

The approximation that the vacuum polarizations are linear in q? breaks down for 
new physics that is not much heavier than Mz. The oblique parameter formalism can 
be extended to accomodate this by the inclusion of more parameters (Maksymyk et al., 
1994). The formalism can also be extended to include effects relevant to LEP 2 (Barbieri 
et al., 2004). The oblique parameters can also be redefined in terms of higher-dimensional 
operators (Barbieri et al., 2004; Han, 2008; Skiba, 2011). 


Summary 


The precision Z-pole, LEP 2, WNC, Tevatron, and LHC experiments have successfully 
tested the SM at the 0.1% level, including electroweak loops, thus confirming the gauge 
principle, SU(2) x U(1) group, representations, and the basic structure of renormalizable 
field theory. sin? Oy was precisely determined, m, was roughly predicted from its indirect 
loop effects prior to the direct discovery at the Tevatron, and the indirect value of as, 


25The production rate via gluon fusion would be increased by a factor ~9 because of the heavy quarks in 
the loop. See Section 8.5 and Problem 8.21. 
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Figure 8.28 Ay? = 1 allowed regions in the S-T plane for individual types of measure- 
ments and the 90% c.l. global fit region. The S and T shown here parametrize the 
effects of new physics only. Plot courtesy of the Particle Data Group (Patrignani, 
2016). 


extracted mainly from the Z-lineshape, agreed with more direct QCD determinations. The 
precision data also predicted the Higgs mass, though with large uncertainty, consistent 
with the value subsequently observed at the LHC and suggestive of supersymmetry. The 
agreement of the data with the SM imposes a severe constraint on possible new physics at the 
TeV scale, and points towards decoupling theories (such as most versions of supersymmetry 
and unification), which typically lead to 0.1% effects. On the other hand, generic versions of 
new TeV-scale dynamics or compositeness (e.g., dynamical symmetry breaking, composite 
Higgs, or composite fermion models) usually imply deviations of several %, and often large 
flavor changing neutral currents, although decoupling (or fine-tuned) versions may still be 
viable. Finally, the precisely measured gauge couplings were consistent with the unification 
expected in the simplest form of grand unification if the SM is extended to the MSSM 
(Section 10.4). Most of these results could be made even more precise by a high intensity 
Z-pole option for a possible future ete~ collider (e.g., Bicer et al., 2014). 


8.4 GAUGE SELF-INTERACTIONS 


The predicted gauge self-interactions of the SM, listed in Table 8.1, are essential probes of 
the structure and consistency of a spontaneously-broken non-abelian gauge theory. Even 
tiny deviations in their form or value would destroy the delicate cancellations needed for 
renormalizability, and would signal the need either for compensating new physics (e.g., from 
mixing with other gauge bosons or new particles in loops), or of a more fundamental break- 
down of the gauge principle, e.g., from some forms of compositeness. Experimental tests of 
the self-interactions are therefore important, although in practice many types of new physics 
that could modify them would show up sooner in the Z-pole and WNC experiments (e.g., 
De Rujula et al., 1992; Burgess et al., 1994). 
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In the SM, the triple gauge couplings (TGC) for WW Z and WW 7 are 


Wilpw(@QWwe(r): — ieCyure(p, 4,7) 


R 7 ; (8.190) 
W, (p) Ly (QWs (r) : ze cot OwCuvo (p, q, F) 
where the charges and momenta flow into the vertex and 
Cava (D; q, r) = Gavid P)o t Juo (P Pi F Jvo(T =; Gis (8.191) 


These have been constrained by measuring the total cross section and various decay distri- 
butions for e~et — W-W* at LEP 2, and by observing pp (pp) > WtW-,WZ, and Wy 
at the Tevatron (LHC). Possible anomalies in the predicted quartic vertices in Table 8.1, 
and the neutral cubic vertices for ZZZ, ZZy, and Zyy, which are absent in the SM, have 
also been constrained. 

The three tree-level diagrams for e~et — W-W* are shown in Figure 8.29. The cross 
section from any one or two of these rises rapidly with center of mass energy, but gauge 
invariance relates the gauge three-point vertices to their couplings to the fermions in such 
a way that at high energies there is a cancellation. It is another manifestation of the can- 
cellation in a gauge theory that brings higher-order loop integrals under control, leading to 
a renormalizable theory. It is seen in Figure 8.30 that the expected cancellations do occur. 


w wt wo. wt wo. w+ 
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i 


Figure 8.29 Diagrams for e~et + W- W+ in the SU (2) x U(1) model. 


More detailed information can be obtained from the angular distributions. It is useful to 
parametrize the TGCs in a form more general than the SM predictions. Neglecting terms 
that are irrelevant for light fermions, there are only seven possible Lorentz invariant form 
factors for the effective VW-Wt vertex, where V=y or Z (e.g., Hagiwara et al., 1993; 
Gounaris et al., 1996; Schael et al., 2013): 


Av 
2 
Mw 


iL” =g9yww [oY (Wa WY — Wi WY) +y WIW, V" + VYW PW, 


+ ig euvpo ((B°W-*) WHY — W= (0PWH)) V? + igy WE W, (0"V” + 0V”) 


k À 
-EWT WE Vo — o Wap W th eV ra, (8.192) 
myy 


where F,, = b Fy — bF, for F = WF, Z, or y is the non-interacting field strength. The 
form factors must be real (up to rescattering effects, which are small for a weakly coupled 
theory), and to first approximation one can neglect any momentum dependence and treat 
them as constants. The coefficients gyww are arbitrary normalizations, and can be set to 
the conventional values gyww = e and gzww = ecot Oy. The SM expectations are then 


=f =r=rz=1, àv = 9%; = Řv = iy = 0. (8.193) 
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Figure 8.30 Cross section for eet — W-W* compared with the SM expectation. 
Also shown is the expectation from t channel ve exchange only, and for the ve and 
y diagrams only. Reprinted with permission from (Schael et al., 2013). 


Going beyond the SM, the gY, ky, and Ay are C and P invariant, gy violates C and 
P but preserves CP, and the others violate CP. The C, P, or CP violating couplings are 
relatively weakly constrained at present and are usually neglected. It is reasonable to impose 
U(1)q invariance, since it is believed to be unbroken, in which case g? = 1 and gj = g} = 0. 
In fact, g], ky, and à can be interpreted in terms of the WF electromagnetic multipoles 


qw =eg}, pw =e(L+K,+Ay)/2Mw, Qw = -elk — Ay)/Mi, (8.194) 


which are, respectively, the W* charge, magnetic moment, and quadrupole moment, pre- 
dicted in the SM to assume the values e, e/Mw, and —e/Mĝ;. The stronger assumption of 
a global (custodial) SU (2) x U(1) symmetry in the limit g’ = 0, yields two more constraints 

kz = gf — tan? 0w (ky — 1), Ag = Ay: (8.195) 


Making all of these assumptions, one is left with three real parameters, Os Ky, and Ay, 
which are used in most analyses. (In fact, the data is usually analyzed allowing only one to 
vary from the SM value at a time.) The combined results of the four LEP 2 experiments 
are (Schael et al., 2013) 

gf — 1 = —0.016(19), Ky — 1 = —0.018(42), A, = —0.022(19), (8.196) 


in agreement with the SM expectations of zero. The ATLAS and CMS results from Run I 
were comparable or stronger. 


W Helicity Measurements 


The polarizations of produced W=, which can be measured by the angular distributions of 
their decay products in the W rest frame, probe the couplings of the W in the production 
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process. In particular, the fraction of longitudinally polarized W’s may differ from the SM 
prediction in models with admixtures of right-handed currents or in alternative models of 
spontaneous symmetry breaking (e.g., Chen et al., 2005). Polarization studies have been 
carried out at LEP 2 using e~et — W-W* and at the Tevatron and LHC using t > Wb. 
In t > bW* (with t at rest) one expects the fraction Fo of longitudinally polarized WT to 
approach unity for m; >> My, due to the large numerical value of the components of the 
longitudinal polarization vector e(pyw,3) ~ pw/Mw for Ew > Mw in Equation (2.127) on 
page 28, while for the actual masses one expects Fy = m?/(m? +2Mj,) ~ 0.70. (See (Kane 
et al., 1992) and Problem 8.15.) Similarly, the fraction F} of W* with helicity +1 should 
vanish for small ma due to the V — A coupling, so that F_ ~ 0.30. A recent CMS ob- 
servation (Khachatryan et al., 2016a) at 8 TeV yielded Fo = 0.681(26), F_ = 0.323(16), 
and F, = —0.004(15), to be compared with the full SM prediction, including NNLO QCD 
corrections (Czarnecki et al., 2010), Fo = 0.687(5), F- = 0.311(5), and F, = 0.0017(1). 


8.5 THE HIGGS 


The spontaneous symmetry breaking sector has always been the most uncertain part of 
the standard model. The simplest possibility is that the SSB is accomplished by the VEV 
of an elementary Higgs doublet, as described in Section 8.2.1, or by a more complicated 
Higgs sector involving two or more Higgs doublets (as in the supersymmetric extension 
of the SM) or other representations. It is also possible that the SSB is associated with a 
dynamical mechanism not involving elementary scalar fields, as will be touched on at the end 
of this Section and in Chapter 10. The observation of a Higgs-like boson?° with mass ~ 125 
GeV at the LHC in 2012 strongly supported the notion of an elementary Higgs doublet. 
However, the initial measurements of the Higgs couplings were not sufficiently precise to 
exclude some of the alternatives, especially limiting cases involving a boson very similar 
to the SM Higgs. In this section, we describe some of the theoretical constraints on the 
SM Higgs and some of its predicted properties for an arbitrary mass. We then discuss the 
searches for the Higgs, its discovery and study at the LHC, and the implications. For more 
detailed discussions, see (Gunion et al., 1990; Carena and Haber, 2003; Gomez-Bock et al., 
2007; Djouadi, 2008a; de Florian et al., 2016); the Higgs Boson review by Carena, Grojean, 
Kado, and Sharma in (Patrignani, 2016); and the sites devoted to Higgs production, decay, 
and experimental constraints listed in the websites section. 

In the standard model there remains one physical Higgs particle H after spontaneous 
symmetry breaking, with a potential given by (8.40) on page 265, and gauge, self-interaction, 
and Yukawa couplings to fermions given in Table 8.1. The mass is My = V2Xv, where À is 
the quartic self-coupling and v ~ 246 GeV is /2(0|¢°|0). The couplings to a particle of mass 
M are always proportional to M/v, M?/v, or (M/v)*. Thus, the couplings to fermions are 
small except for the top quark, making the H hard to produce or detect in that way. More 
optimistic are gauge couplings, such as VVH where V = W= or Z, which are proportional 
to gv ~ M2 /v. 


8.5.1 Theoretical Constraints 

The Higgs mass and quartic coupling are related by 

_ Mh; _ GrMĀ 
~ 8M3} y2’ 


26 That is, the SM Higgs boson or something closely resembling it. In the following it will also be referred 
to as the “Higgs boson” or the “Higgs”. 


Me = 20", à (8.197) 
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where v ~ 246 GeV and Gp ~ 1.2 x 10-°GeV? are known. The only constraint we have 
imposed so far on A is (tree-level) vacuum stability, A > 0, which would allow any My from 
0 to co. However, there are a number of more stringent theoretical constraints that lead to 
nontrivial upper and lower bounds. In this section we consider issues related to the Higgs 
and gauge boson self-interactions. Other theoretical issues, involving the Higgs contribution 
to the vacuum energy and the higher-order corrections to the Higgs mass, are discussed in 
Section 10.1. 


Renormalization Group Constraints 


The renormalization group equations for the running gauge couplings are given for an 
arbitrary gauge theory in (5.33) on page 170. For the SM the £ function one-loop coefficients 
in (5.34) are 


a a (1) 
90 ~~ 162 3 | Fona- 16g? 
1 [22 4F ny 1 19 
— Al 
bg 1672 E 3 6 | ram 1672 ( >) (5108) 


p, 1 [20F | nn ee a 
"1672 9 6 | F=3,nz=1 1672 6 


for SU(3), SU(2), and U(1), respectively.” These are valid at momenta much larger than 
m and v, and assume the existence of F fermion families and ny Higgs doublets. For F = 3 
and ny = 1, both SU(3) and SU(2) are asymptotically free. 

A and the Yukawa couplings also run. At one-loop (see, e.g., Cheng et al., 1974; Gunion 
et al., 1990) 


dA(Q? 1 3 
Z = [24x + 24Ah? — 24h? — 3A (3g? + g”) + 5 (204 + (9? +97)? 
(8.199) 
dhi(Q?) — 1 3 2 9 2 17 12 
din@? 32m? [oni he (89: +49 + 799 Jf 


where it is understood that the couplings on the r.h.s. are the running couplings at Q?. 
The quantity in (8.197) is the low energy value \(v?), while hi(v2) ~ m/v is the t-quark 
Yukawa coupling defined in (8.46) on page 267 or in (8.54); we have neglected all of the 
other Yukawas. The running of h; is due to vertex, t quark, and Higgs self-energy diagrams 
(for a scalar coupling there is no analog of the Ward-Takahashi identity described in Section 
2.12.1). Typical one-loop diagrams contributing to the running of are shown in Figure 
8.31. 

The first bound that we consider is the triviality upper limit on My (Cabibbo et al., 
1979). From (8.197), \(v) is larger than unity for My = 350 GeV, while h;(v?) ~ 0.7 for 
mMm ~ 173 GeV and the electroweak gauge coupling terms at low energy are small (Table 
1.1). For large My one can therefore approximate the A equation by the first term on the 
r.h.s., which is immediately solved to obtain 


A(n?) 
A s 
ACQ) = TAE eA a g : (8.200) 


27In considering grand unified theories it is conventional to introduce the GUT-normalized U(1) gauge 
coupling gi = VETE which has the coefficient bg, = 3b 1> Wee ( H 3). 


g 
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Figure 8.31 Typical diagrams contributing to the running of A. 


This diverges at the Landau pole 
Qrp = ve /3A0), (8.201) 


Presumably, it does not make sense for À to diverge within the domain of validity of the 
theory.”° It suffices that Qzp is larger than the scale A at which new physics sets in (and 
above which the RGE in (8.199) no longer apply). Requiring Qzp > A leads to the triviality 
limit 


3Gr In(A/v) O(650) GeV, A ~ 1500 GeV ’ (8.202) 


1/2 
var ( 2/2n? / { O(140) GeV, A ~ Mp 
) 

where Mp = Gf 2 ~~ 1.2 x 10!9 GeV is the Planck scale. This limit is somewhat fuzzy, 
because the one-loop approximation is not valid when A is large, and in fact perturbation 
theory breaks down. However, it provides a reasonable estimate (as a function of the new 
physics scale) of how large a value of My is consistent with having a weakly coupled field 
theory. In particular, the observation of a SM-type Higgs at a mass scale much larger than 
200 GeV would have strongly suggested that new physics sets in at a rather low scale. A more 
detailed evaluation including two-loop effects and the neglected terms in (8.199) is shown 
in Figure 8.32. One finds that My < 180 GeV for A ~ Mp, while My S 700 GeV for A < 
2M. (It would not make much sense to consider an elementary Higgs field for a lower A.) 
The latter upper bound can be justified by non-perturbative lattice calculations (Hasenfratz 
et al., 1987; Kuti et al., 1988; Luscher and Weisz, 1989), which suggest an absolute upper 
limit of 650 — 700 GeV. 

There is also a lower limit on the Higgs mass from vacuum stability (e.g., Cabibbo et al., 
1979; Degrassi et al., 2012; Espinosa, 2016). To see this, consider the small limit of the 
first equation in (8.199). If we keep only the (dominant) hf term,?° and also treat h; as a 
constant, then one finds 


3h4 2 
donean a (8.203) 
4r? n? 
which goes negative at Q? for which 
She. Q? aft Q 
Av?) = — hn Ss, M2, = —+— In—. 8.204 
(v ) An2 n y2 H V2n?GF n v ( ) 


?8In the pure AH4/4 theory the only way for the one-loop RGE solution to remain finite for Q? —> oo 
would be to take A(v?) = 0, justifying the term triviality. 
29Lower limits relevant to a much lighter t quark are reviewed in (Gunion et al., 1990). 
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Figure 8.32 Theoretical limits on the SM Higgs mass as a function of the scale A at 
which new physics enters. The upper limit is from the absence of a Landau pole 
below A, while the lower limit is from vacuum stability. Reprinted with permission 
from (Hambye and Riesselmann, 1997). 


A negative A within the domain of validity of the theory would suggest an unstable vacuum, 
so we have a lower limit on A(x?) and M?, coinciding with the terms on the r.h.s. in (8.204) 
for Q = A. In particular, My = 85 GeV for A = 1500 GeV. For large A it is not valid 
to neglect the running of h+. One needs to integrate the coupled equations, include two- 
loop effects, and include loop contributions to the effective potential in addition to those in 
\(Q?). Typical results are shown as a function of A in Figure 8.32. Combining the triviality 
and vacuum stability limits, My is constrained to the rather limited range 130—180 GeV for 
A = Mp, with a somewhat larger range for smaller A. The lower limit is weakened somewhat 
if one allows a sufficiently long-lived (i.e., longer than ~13.8 Gy, the age of the Universe) 
metastable vacuum (Espinosa and Quiros, 1995; Isidori et al., 2001). Early estimates allowed 
a limit in that case of around 115 GeV for A = Mp (but see the comments in Section 8.5.3). 

These limits do not apply in the minimal supersymmetric extension of the SM (the 
MSSM), because A is not an independent parameter and there are additional contributions 
to the RGE. There is a complementary upper limit of ~ 135 GeV on the lightest Higgs 
scalar in the MSSM (increasing to ~ 150 GeV in singlet extensions of the MSSM), which 
is close to the SM vacuum stability lower bound for most values of A. The observation of a 
Higgs much heavier or lighter than 135 GeV would have helped distinguish between the SM 
and supersymmetry, but the observed 125 GeV is inconclusive and somewhat challenging 
for both. 


Tree Unitarity and the Equivalence Theorem 


Another theoretical upper limit is based on the (tree-level) unitarity for WtW- > WtW- 
scattering and related processes such as ZZ, ZH, and HH scattering (Lee et al., 1977) (cf. 
the discussions of unitarity breakdown in high energy vee + vee and ete” — WtW- 
scattering in Section 7.1). The potential difficulty involves the longitudinal polarization 
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states for the W or Z, which dominate in high energy processes since the polarization vector 
x(k, 3) ~ k,/Mw,z grows with k at high energy. The amplitude for WWF —> W/W, 
(the subscript indicates longitudinal) is approximately 


t 
M = -iV2G pM? 8.205 
wae i (se + ma) can 


for s, Mj, > Mj,, with the relevant diagrams shown in Figure 8.33. If one takes My — 00 
this grows linearly with s, leading to violation of S-wave unitarity (Appendix C) at ys ~ 
(167\/2/G)'/? > 2.4 TeV, analogous to the problem in (7.6) on page 225 for vee > vee 
scattering in the Fermi theory. This illustrates why the existence of the Higgs (or some 
alternative form of spontaneous symmetry breaking) is essential to the consistency of the 
theory, since taking My — oo is equivalent to removing it from the theory. However, there 
are problems even for finite My. It is straightforward to show (Lee et al., 1977; Gunion 
et al., 1990) that the S-wave amplitude obtained from (8.205) grows with M?, for s > MẸ, 
i.e., 


Gr M? 
ao = -2E , (8.206) 
An /2 
where ao is the S-wave projection of the amplitude T = —iM, as defined in (C.5) and 


(C.7). The unitarity condition |a9| < 1 then leads to an upper limit on My. This can be 
strengthened somewhat by considering the coupled channel analysis and a more precise 
unitarity constraint, leading to the unitarity bound 


1/2 
Any/2 
My < ( 2) ~ 700 GeV, (8.207) 
3GF 


which is comparable to the lattice version of the triviality bound. 
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Figure 8.33 Tree-level diagrams for WtW- — W*W-. The zigzag lines can repre- 
sent a Z, y or H. 


Of course, (8.207) is based on the tree-level amplitude, and unitarity only applies rig- 
orously to the full amplitude. One should therefore interpret (8.207) as the condition for 
a weakly coupled (perturbative) Higgs and gauge self-interaction sector, for which higher- 
order corrections to the amplitude are not expected to be important. Conversely, a violation 
of (8.207), or the nonobservation of a Higgs below this scale, would have suggested that 
spontaneous symmetry breaking is associated with a strongly coupled Higgs sector or some 
strong coupling alternative to the elementary Higgs mechanism. This would presumably be 
manifested by enhanced WW cross sections at high energy (Chanowitz and Gaillard, 1985) 
and effects such as WW (bound state) resonances. 
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In Section 8.2.1 we applied the Kibble transformation to the Higgs doublet following 
SSB to go to the unitary gauge, in which it is manifest that the Goldstone bosons are 
eaten to become the longitudinal degrees of freedom of the WF and Z. We also saw above 
that amplitudes involving the W and Z at high energy are dominated by their longitudinal 
components. It is therefore not surprising that such high energy amplitudes can be calculated 
more easily in terms of the original Goldstone degrees of freedom using the equivalence 
theorem (Lee et al., 1977). (More rigorous discussions in more general gauges and including 
higher-order effects are given in (Chanowitz and Gaillard, 1985; Chanowitz et al., 1987).) We 
will illustrate this with a simple example, using the expressions (8.85) on page 277 for the 
Higgs doublet ¢ and (8.88) for the Higgs potential V(@) in an Re gauge, in which H is the 
physical Higgs field, and w* and z are the Goldstone degrees of freedom that disappear in 
the unitary gauge. (The gauge and Yukawa interactions for ¢ are also given in Section 8.2.4.) 
The equivalence theorem states that amplitudes involving high energy longitudinal Z’s and 
W’s can be obtained by the much simpler calculation of the corresponding amplitudes 
involving z and w. As a simple example, the wtw~ — wtw7 amplitude is given at tree- 
level by the four-point ww wtw" vertex derived from (8.88), and by H exchange in the 
s and ¢ channels. One finds 


M = —4id + (—2idrv)? 


a a s t 
= —2i 8.208 
ae tic ' a eae o (8.208) 


which reproduces (8.205) after using (8.197). 


8.5.2 Higgs Properties, Searches, and Discovery 


The Higgs interaction vertices are displayed in Figure 8.3 on page 267. They are always 
proportional to the mass of a fermion or to mass-squared for a boson, making the Higgs 
difficult to produce or detect in processes involving light particles. Here we consider searches 
for a Higgs boson anywhere in the mass range My Z O(10 GeV), and then specialize to 
the observed value My ~ 125 GeV. Searches for a lighter Higgs in nuclear physics, meson 
decays, etc., are described in (Gunion et al., 1990). 


Higgs Production 


There are a number of production mechanisms for the Higgs at hadron colliders, some in 
association with other particles, as indicated in Figure 8.34. The gluon-gluon fusion (GGF) 
mechanism GG — H, which proceeds via a virtual t-quark loop, has the largest cross section 
at both the Tevatron and the LHC. Observation of gluon-gluon fusion allows an indirect 
constraint on the ttH coupling, even for My < 2m,. There are other modes with lower 
rates than gluon-gluon fusion, but with the advantage of allowing tagging on the associated 
particles. For example, the associated production of W*H or ZH (Higgstrahlung) is im- 
portant. The ZH mode can be tagged by Z > £~£* with £= e or p, allowing a constraint 
on the ZH rate even for otherwise difficult modes such as H — bb or for (non-standard) 
Higgs decays into unobserved particles. WH —> v@H and ZH — viH have even larger 
rates. Vector boson fusion (VBF), i.e., W*W- > H or ZZ > H with the W or Z radiated 
from a q or q, has the second highest rate at the LHC and is associated with two hard jets. 
Associated ttH and bbH production is also important at the LHC. QCD (and electroweak) 
radiative corrections can be large. This is especially true for gluon fusion, where they are of 
O(100%) at the LHC. They are described in the reviews and summarized in (Heinemeyer 
et al., 2013; de Florian et al., 2016, and Problem 8.20). The various production cross sec- 
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tions, including higher-order corrections, are shown for pp collisions as functions of My and 
ys in Figure 8.35. 
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Figure 8.34 Representative Higgs production diagrams at a hadron collider, including 
gluon-gluon fusion (left), WW or ZZ fusion (second left), and typical diagrams for 
associated production of WH, ZH, or ttH. 
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Figure 8.35 Higgs production cross sections at the LHC as a function of My 
for ys = 14 TeV (left), and as a function of ys for My = 125 GeV 
(right). Figures courtesy of the LHC Higgs Cross Section Working Group, 
twiki.cern.ch/twiki/bin/view/LHCPhysics/LHCHXSwG (de Florian et al., 2016). 


The dominant production mechanism in e~e* at lower energies (e.g., LEP) is through 
associated Z production (Higgstrahlung), e~et > Z > ZH, analogous to the third diagram 
in Figure 8.34. At higher energies (ys = 450 GeV for My = 125 GeV), the WW fusion 
process eet —> y. W Wty, > VeDeH (analogous to the second diagram in Figure 8.34) 
dominates because the cross section scales as In(s/Mj,) rather than 1/s (see page 219). 
ZZ fusion, e~e* + e~e*H, is cleaner than WW fusion, but the cross section is an order 
of magnitude smaller. Smaller still is the cross section for e~e+ — ttH. However, this is 
dominated by the radiation of the Higgs from the t or ¢ produced in e~et — tt and could 
allow the determination of the ttH coupling in a future high energy collider. The predicted 
e~e* cross sections for My = 125 GeV are shown in Figure 8.36. 
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Figure 8.36 Left: SM cross sections for eet 4 HX (X = Z,vp,e™ et, tt,- ) 
for My = 125 GeV as a function of ys, courtesy of CERN (Le- 
brun et al, 2012). Right: branching ratios for SM Higgs decays as 
a function of My, courtesy of the LHC Higgs Cross Section Work- 
ing Group, twiki.cern.ch/twiki/bin/view/LHCPhysics/LHCHXSWG (Heinemeyer 
et al., 2013). 


Higgs Decays 


The standard model expectations for the various Higgs decay modes, including the details 
of the QCD and electroweak radiative corrections, are reviewed in (Djouadi, 2008a; Denner 
et al., 2011; Almeida et al., 2014; Lepage et al., 2014). 

The predicted Higgs decay branching ratios are shown as a function of My in Figure 8.36. 
For large My, the decays H — WtW- and H —> ZZ dominate. The cleanest signature 
in this case would be for the “golden” mode H > ZZ — 4, but this is suppressed by the 
low leptonic branching ratios. The chains H > WW —> qĝlv and H > ZZ —> qqll would 
therefore also be critical. Below the WW threshold, down to ~ 135 GeV, the decay WW*, 
where W* is off-shell, dominates, and ZZ* is also important. For still lower masses H — bb 
has the largest branching ratio. However, this mode is difficult at hadron colliders because 
of the very large QCD background, especially at the LHC. Thus, a Higgs produced by gluon 
fusion would be unobservable in bb. The most promising light Higgs channels at the LHC 
are H — yy, which proceeds at one-loop, and H > ZZ* — 4¢. These have very small 
branching ratios, but have clean signatures and lower backgrounds than the other modes, 
and in fact these turned out to be the discovery channels. Other modes with significant 
branching ratios for a light Higgs include GG, rtr, and cé. Associated productions of the 
Higgs with W, Z, or tt, as well as vector boson fusion with hard forward quark jets, provide 
powerful additional handles, e.g., allowing the observation of H — bb at hadron colliders. 

The partial Higgs decay widths into fermions are 

= Grmi 


M(H ff) = 0,7 


where Bf = (1— 4m3 M)? is the fermion velocity and Cp = 1 (leptons) or 3 (quarks) is 


b} Muy, (8.209) 
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the color factor. The dominant QCD corrections for quarks are included by evaluating the 
running masses at M7, e.g., mp[(125 GeV)?] ~ 3 GeV, reducing the width from the leading 
order expression. There are additional (smaller) corrections, e.g., from vertex diagrams. 
These fermionic widths are very small (except H — tt for a very heavy H). For example, 
T(bb)/My ~ 1.9 x 1075, so that T(bb) ~ 2.4 MeV for My = 125 GeV. The partial widths 
for on-shell WW- or ZZ are 


Gr 
16/27 
where ôw = 2, dz = 1, and zy = 4M?/M},. The partial width grows as M}. This is 


because for My >> My the decay is dominated by the longitudinal vector states, and their 
polarization vectors are of order My/My (see Problem 8.25). Asymptotically, 


I'(H > VV1) = dy (1—a2y)/? (1 -zv + it) Mẹ}, (8.210) 


1/ Mg \* 
+y- aff eee 
r(H > WW 4+ 2Z)/My 5 G t) F (8.211) 
so a heavy Higgs is expected to be very broad, while a light one is very narrow. The partial 
widths for virtual decays, [(H — VV*) are given, e.g., in (Gunion et al., 1990; Djouadi, 
2008a). 
The decays H > yy, Zy, and GG occur at one-loop. At leading order 


Gra? M} 
36/273 


for My « 2m:, from the top-loop diagram in Figure 8.37, with a, evaluated at My. There 
is a very large QCD correction. At the next order (Spira et al., 1995) the rate is enhanced 
by 1+ da,/m, where 6 ~ 95/4 — 7N;/6, i.e., an increase of ~ 60% for Nr = 5. The GG 
decay is of course difficult to observe at a hadron collider but can be probed indirectly by 
measuring the GG fusion rate. Similarly, the H — yy decay is dominated by the W and 
top loops in Figure 8.37, which partially cancel. For My < 2My, the SM prediction is 


I(H > GG) ~ (8.212) 


2 


G 2 M3 
po% VH ; (8.213) 
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where the first term is from the W, while the 4C;q?7/3 = 16/9 is from the top. See the 
reviews for corrections for finite My, higher order corrections, and [(H > Zy). 
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Figure 8.37 Diagrams for the loop-induced decays H + GG and H > yy. 
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Precision Electroweak Constraints 


As described in Section 8.3.4, the precision electroweak data depend logarithmically on 
My due to the loop contributions to the gauge self-energies, shown in Figure 8.17 on 
page 301. These favor 66 < My < 134 GeV at 90% cl, with a central value My= 96173 
GeV (Patrignani, 2016), consistent with 125 GeV at 1.30. The allowed region in the m:-My 
plane is shown in Figure 8.26 on page 316. 


Direct Searches at Colliders®© 


The Higgs was searched for directly at LEP, through the Higgstrahlung process e~et — 
Z — ZH, which probes the ZZH vertex in Figure 8.3. At LEP 1, the s-channel Z was 
on-shell and the final one virtual, while at LEP 2, the s-channel Z was virtual. Various 
combinations of final states were searched for, including H — bb or 7~ 7+ and Z > qq, 0 4*, 
or vv. The LEP experiments were able to exclude a SM Higgs with mass below 114.4 GeV 
at 95% cl (Barate et al., 2003; Patrignani, 2016). LEP 2 also obtained limits on possible 
Higgs-like states with masses below 114.4 GeV but with reduced rates compared to the SM 
expectation, as would be expected in some extended models. 

The CDF and DO experiments at the Tevatron could probe to higher values of the 
SM Higgs mass once they had acquired sufficient integrated luminosity. Initially, they were 
mainly sensitive to H + WW- in the mass range 160-170 GeV near or just above 
threshold. With increased luminosity their sensitivity increased and they could search for 
lower Higgs masses via associated VH production (V = W or Z), with H — bb (the 
Tevatron bb background is not so large as that at the LHC). By the time the Tevatron 
ceased running in 2012 the collaborations had searched for the SM Higgs in the range 
90-200 GeV in the decay modes bb, WtW-, ZZ, 7~ 7+, and yy at ys = 1.96 TeV with 
integrated luminosities L = f £dt up to 10 fb~', taking into account all of the production 
mechanisms in Figure 8.34. Their combined analysis (Aaltonen et al., 2013) excluded a SM 
Higgs in the mass ranges 90-109 and 149-182 GeV, and observed a broad excess of events 
(mainly Vbb) from 115-140 GeV. Taking resolution into account this was consistent with 
the LHC observations and with a SM Higgs, with a local significance’?! of 3.00 for My = 125 
GeV. 


The Higgs Discovery and Properties 


Because of the higher energy (and therefore cross sections), and eventually the larger inte- 
grated luminosity, the LHC experiments ATLAS and CMS were sensitive to a SM or MSSM 
Higgs over the entire relevant mass range. In Run 1 each of the two experiments obtained 
L~5 fb! at /s = 7 TeV during 2011 and ~ 20 fb~! at y/s = 8 TeV in 2012. 

The 7 TeV data, combined with the earlier collider experiments, was sufficient to exclude 
a light SM Higgs below 116 GeV or a heavy SM Higgs between 127 and 600 GeV, leaving only 
a relatively small window (which was also in the range expected from precision electroweak 
measurements). By the end of 2011 both experiments reported significant excesses around 
120-126 GeV in both the yy and 44, £ = e* or w* channels. 

By the summer of 2012 there were sufficient statistics for CMS (Chatrchyan et al., 2012) 
and ATLAS (Aad et al., 2012) to announce > 5ø discoveries of a Higgs-like particle of mass 


30The history is described in, e.g., (Bernardi and Herndon, 2014; Dittmaier and Schumacher, 2013). 

31 The local significance is for a definite Mz, while the lower global significance takes into account the 
look-elsewhere effect, i.e., that a statistical fluctuation of the background could have occurred anywhere in 
a given mass range. 
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~(125-126) GeV. The most significant channels were again yy and 4¢ (from H > ZZ* > 
4¢), for which there is the best mass resolution. The final combined ATLAS-CMS analysis 
of the Run 1 data yielded My = 125.09 + 0.24 GeV (Aad et al., 2015a). 

The Higgs production and decay properties were extensively studied by ATLAS and 
CMS in Run 1, as reviewed in, e.g., (Murray and Sharma, 2015; Aad et al., 2016; Patrignani, 
2016). They established not only the yy and ZZ* decays, but also H + WW*, which is 
observed as a broad enhancement in 2¢ (with the neutrinos unobserved), and H + T+T. 
There is also some evidence (at about 2c) for H — bb (mainly from associated ZH and 
WH production), as well as upper limits on rare decay modes such as y“ wt and Zy. 

The various production mechanisms can be separated experimentally, e.g., by tagging 
on the associated forward jets, W, Z, or t. Most of the combinations o,By have been 
determined, where g; represent the cross sections for gluon-gluon fusion, vector boson fusion, 
or associated WH, ZH, or ttH production, and By =T'y/Iy is the branching ratio into 
f=, ZZ*, WW*, Ttr", or bb. The results for the ratios of a; By; to the SM expectations 
are shown in Figure 8.38. They are consistent with unity, though with large uncertainties. 
A global analysis over all i and f yields a a rate of 1.09(11) relative to the SM. 
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Figure 8.38 Signal strengths u (ratio of observed rate to the expectation for a SM 
Higgs) for various production processes and subsequent decay modes for the com- 
bined ATLAS and CMS analysis of Run 1 data, from (Aad et al., 2016). 
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The basic observables o;By at a hadron collider do not allow a separation between the 
cross sections and branching ratios without additional information, such as a theoretical 
calculation of the cross section.?? However, the relative branching ratios into two different 
final states, the ratios of production cross sections, or the ratios of events at different energies 
can be determined. All agree with the SM within uncertainties. 

An alternative way of analyzing the data is to assume the SM production and decay 
mechanisms, but to allow the elementary WW H, ZZH, ttH, bbH, and r+7~ H couplings 
to vary by factors k compared to the SM expectations. All of the «’s are consistent with 
unity within the (10-20)% uncertainties except for bbH, for which xe ~ 0.671922 (Aad 
et al., 2016).°3 These results are displayed in Figure 8.39, in which the observed couplings 
are seen to agree with the SM prediction that they should be proportional to a power of 
mass, i.e., to hy = my/v for fermion f, or to hy = 2M?/v for V = W, Z. 
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Figure 8.39 Plot of the observed tree-level Higgs couplings to fermions (Km f/v) and 
gauge bosons (,/Ky My /v) as a function of mass, compared with the SM prediction 
kf = ky = 1 (dashed line). Courtesy of CERN. 


Some beyond the standard model scenarios predict Higgs decay modes into invisible 
or unobserved final states, such as MSSM neutralinos or dark sector candidates for dark 
matter, or into a pair of pseudoscalars in some models with extended Higgs sectors. These 
are constrained by tagged events such as Higgstrahlung, vector boson fusion, or gluon fusion 
with a monojet from initial state radiation, yielding limits of ~(60-70)% at 95% c.l. on the 
branching ratio for H — invisible. A more stringent upper limit of 34% is obtained in a 


32Tn principle the cross sections could be separated by tagging some inclusive production mode, such as 
associated WH or ZH, with W or Z > jets, Z > 2£, or Z — bb. 
33There are additional solutions with different signs due to interference effects. 
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generalization of the global analysis with arbitrary couplings, allowing for invisible decays 
and new particles in the loops. 

A direct determination of the total width [7 would be extremely desirable, both for 
relating branching ratios to absolute partial widths and for further constraining unobserved 
or invisible decay modes. CMS obtained an upper limit of 3.4 GeV at 95% c.l. on TH 
from the 44 lineshape. This is very much larger than the intrinsic width of ~ 4.1 MeV 
expected in the SM, and is presumably due almost entirely to the experimental resolution. 
CMS and ATLAS subsequently obtained much more stringent limits Ty < 22 MeV at 
95% c.l. indirectly by comparing the H + ZZ* and WW* rates on-shell and off-shell, 
making use of the Breit-Wigner energy dependence in (F.2) on page 523, and assuming 
no extra decay modes are accessible off shell. This is still considerably larger than the SM 
expectation, however. A precise determination of IT y will probably require a measurement 
of the e~et — ZH > ZZZ* rate at a future e e* collider (to obtain the ZZH vertex and 
therefore r(H > ZZ*)), combined with B(H > ZZ*). 

Another critical aspect is to determine the spin, parity, and charge conjugation quan- 
tum numbers JPC. These are predicted to be 0+*+ for the SM Higgs, i.e., the couplings to 
fermions and vectors are proportional to Hyw and H VV" (or HV, V”” at loop level), 
respectively. However, dynamical alternatives to the elementary Higgs mechanism often in- 
volve pseudoscalars, JPO = 07+, which could be light (Eichten et al., 2012). A pseudoscalar 
P would couple like Pyy°>q) and PV V”, where V” = eae fre If a light pseudoscalar 
somehow had the appropriate coupling strengths it could mimic the SM Higgs. In principle, 
the Higgs-like particle could also have spin higher than 0. 

The fact that the H decays to yy implies that it has nc = +1 (or that it is not a 
C eigenstate). It also implies that J = 0,1, or 2 under the plausible assumption of an 
S-wave decay. CMS and ATLAS have studied the various possibilities using the observed 
distributions in the yy, ZZ*, and WW* decays. All data are consistent with the 0+T 
assignment, and many alternative models involving J? = 0~,1*, and 2+ are excluded at 
better than 99% c.l. 


8.5.3 Implications of the Higgs Discovery 


The various observations described in the previous section leave little doubt that the ob- 
served state is either the SM Higgs, or something very similar. Assuming that it really is 
the SM Higgs, its relatively low mass implies some tension with the the vacuum stability 
considerations discussed in Section 8.5.1. A recent NNLO calculation of the effective po- 
tential and running couplings (Degrassi et al., 2012; Buttazzo et al., 2013) shows that for 
the observed parameters the quartic coupling A(Q?) goes negative for Q around 10/° — 1017 
GeV. See Figure 8.40 (left). However, the decrease of À slows for larger Q? due to cancella- 
tions, implying a metastable vacuum (with the lifetime longer than the observed age of the 
Universe.) It is remarkable that the observed parameters imply that the SM vacuum lies 
in the narrow strip between instability and absolute stability. In particular, the theoretically 
intriguing possibility A( M2) = 0 is excluded at better than 99% c.l. (Buttazzo et al., 2013). 


34 According to the Landau-Yang theorem (Landau, 1948; Yang, 1950), which follows from rotational 
invariance and Bose statistics, a spin-1 particle cannot decay into two identical massless vectors. It can be 
evaded for off-shell, massive, or non-identical vectors, for processes that are mistaken for two photons, or 
by off-shell or interference effects for the decaying particle. It is therefore useful to exclude this possibility 
experimentally. 

35 However, the decay rate could be greatly increased in the presence of Planck-scale higher-dimensional 
operators (e.g., Lalak et al., 2014; Branchina et al., 2015) or if they are nucleated by small black holes (Burda 
et al., 2016). 
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It should be reiterated that these considerations only apply if there is no new physics below 
the Planck scale Mp, or at least below the scale at which vanishes. 
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Figure 8.40 Left: the running of the quartic Higgs self-coupling for My = 125 GeV. 
The vacuum remains metastable for A(M?2) > —0.05. Right: phase diagram for the 
standard model as a function of My and m+, assuming that there is no new physics 
up to the Planck scale. From (Degrassi et al., 2012). 


Extended Electroweak Symmetry Breaking Sectors 


The observed Higgs-like particle is consistent with the SM Higgs. However, the measure- 
ments of its couplings still leave considerable room for deviations. These could be due either 
to extended sectors involving elementary Higgs fields or to the possibility that the Higgs 
sector is an effective theory describing underlying strong dynamics at a higher scale. Some 
of these extensions are motivated by the Higgs/hierarchy problem described in Section 10.1. 
This basically states that the Higgs mass-squared involves quadratically-divergent correc- 
tions of O(\,g?,h?)A? (see (10.2) on page 427), where A is the new physics scale. This 
implies that for A >> TeV there must be fine-tuned cancellations between the bare mass- 
squared and the corrections. 

Many theories beyond the SM have extended Higgs sectors,°° involving additional Higgs 
doublets (such as are required in supersymmetry) and/or Higgs fields transforming under 
different SU(2) representations, such as singlets or triplets (for reviews, see Gunion et al., 
1990; Djouadi, 2008b; Accomando et al., 2006; Barger et al., 2009; Maniatis, 2010; Ellwanger 
et al., 2010; Branco et al., 2012; Patrignani, 2016). Such additional multiplets allow new 
physical spin-0 particles, including extra electrically-neutral states, singly-charged states 
for additional doublets or triplets, and even doubly-charged states for SU(2) triplets with 

= +1. 

Electrically neutral states can mix with the H (even for SU (2) singlets), leading to mass 
eigenstates h;. In some cases, there are two or more h; with scalar couplings to fermions, and 


36We are using Higgs in the sense of any spin-0 color-singlet multiplets that are not forbidden by additional 
symmetries from mixing with the SM Higgs doublet. 
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one or more with pseudoscalar (7°) couplings, and in other cases (involving CP violation) 
the scalars and pseudoscalars mix. In the MSSM, for example, there are two Higgs doublets. 
In the decoupling limit, in which the second Higgs doublet is much heavier than the first, 
there is little mixing and the lighter scalar has couplings essentially the same as those of the 
SM Higgs. However, as will be discussed in Chapter 10, there can be considerable mixing 
between the two scalars (and with the pseudoscalar if CP is violated) in the non-decoupling 
region, in which the new supersymmetric particles and second Higgs are relatively light. 

Mixing will modify the couplings of the mass eigenstates. For example, the overall 
strength of the Yukawa couplings of the up-type quarks are changed relative to the SM 
and to those of the down-type quarks and charged leptons in the MSSM and other two- 
doublet models (Section 10.2.5). Mixing can also lead to new invisible or exotic decay modes 
(increasing the width), or can decrease the width due to singlet mixing. 

Many of these possibilities have been reexamined in light of the discovery of the Higgs- 
like particle, including models with additional singlets (Frank et al., 2013; Robens and 
Stefaniak, 2015) or doublets (Baglio et al., 2014; Craig et al., 2015). It has already been 
commented in Section 8.5.1 that there is a theoretical upper limit of ~ 135 GeV on the mass 
of the lightest Higgs scalar in the MSSM, due to the fact that the analog of A in the SM is 
given by gauge couplings. Furthermore, even 125 GeV requires either very large stop masses 
in the multi-TeV range or large stop mixing. These issues, as well as some singlet-extended 
versions in which the upper limit is relaxed, are discussed in Section 10.2.5. 

In addition to the implications for the Higgs spectrum and couplings, extended Higgs 
sectors can affect the origin of the fermion mass hierarchy and/or lead to flavor changing 
Higgs-fermion couplings (e.g., in non-supersymmetric two-doublet models in which both 
doublets couple to the same fermions). They can also modify the po parameter (i.e., to 
the Mw /Mz relation) at tree level due to the VEVs of Higgs triplets or higher-dimensional 
representations (Problem 8.1) unless a custodial symmetry is somehow imposed (e.g., Logan, 
2014). There are also new mechanisms for explicit or spontaneous CP violation in the Higgs 
sector, and for a strongly first-order electroweak phase transition, both of which are relevant 
to the possibility of electroweak baryogenesis (e.g., Maniatis, 2010; Ellwanger et al., 2010; 
Huang et al., 2016)). There is even the danger of the spontaneous violation of electric charge 
conservation (Problems 8.3 and 8.4). Higgs singlets in extended models can also be dark 
matter candidates (Barger et al., 2009) or can serve as a portal connecting the standard 
model fields to a dark matter sector (Schabinger and Wells, 2005; Patt and Wilczek, 2006). 
Finally, Higgs singlets may offer a dynamical solution to the u problem of the MSSM 
(Section 10.2.6). 

It is also possible that there are no truly elementary Higgs particles. For example, elec- 
troweak symmetry breaking could be associated with the boundary conditions in a theory 
with extra dimensions of space-time [Higgsless models (Csáki et al., 2004)], for which there 
are no analogs of the Higgs field. The observation of the 125 GeV Higgs-like state implies 
that such models are no longer relevant to electroweak physics, but the ideas could play a 
role in, e.g., grand unified theories in extra dimensions. Similar statements apply to dynam- 
ical symmetry breaking mechanisms (Hill and Simmons, 2003). These are based on a new 
strong dynamics and usually do not have a light 0* scalar that could imitate the SM Higgs. 
In technicolor, for example, the SSB is associated with the expectation value of a fermion 
bilinear, analogous to the breaking of chiral symmetry in QCD (Section 5.7). Extended 
technicolor and top-color also fall into this class. 

More promising are composite Higgs models. Typically, these involve a more fundamen- 
tal strongly-coupled sector at the 1-10 TeV scale which (unlike technicolor) does not directly 
lead to electroweak symmetry breaking. However, the Higgs multiplet emerges as a compos- 
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ite state in a low-energy effective theory. A very attractive possibility is that, in the absence 
of electroweak or Yukawa couplings, all four Hermitian components of the Higgs doublet are 
the massless Goldstone bosons associated with a spontaneously broken global symmetry of 
the underlying theory. Turning on the electroweak couplings generates the Higgs potential, 
which in turn leads to SU(2) x U(1) breaking. Three of the Goldstone bosons are absorbed 
by the ordinary Higgs mechanism, while the Higgs-like scalar is a pseudo-Goldstone bo- 
son. The Higgs mass is generically suppressed by O(g/47) compared to the scale A of the 
strong dynamics and of the other composite states. This suggests A = O(1 TeV), which 
is rather low for evading precision electroweak and FCNC constraints. A larger and safer 
A, e.g., 10 TeV, can be achieved by fine-tuning. Alternatively, A naturally increases by an 
order of magnitude (from an extra factor of g/47) in Little Higgs models (Arkani-Hamed 
et al., 2002a), in which the Higgs mass is protected by two symmetries.°’ Agreement with 
precision constraints is further improved by the imposition of a discrete T-parity (Cheng 
and Low, 2003). Composite Higgs models can often be reinterpreted (i.e., are equivalent to) 
theories with extra space-time dimensions. In these dual descriptions the massless Higgs 
fields are identified as gauge fields in the extra dimensions. These ideas, related issues such 
as the generation of Yukawa couplings, and other dynamical mechanisms are reviewed in 
more detail in (Perelstein, 2007; Csáki et al., 2016; Panico and Wulzer, 2016) and in the ar- 
ticles on Status of Higgs Boson Physics and on Dynamical Electroweak Symmetry Breaking 
in (Patrignani, 2016). See also Section 10.1. 

The various types of new physics mentioned above can affect the Higgs in a number of 
ways. As already mentioned, mixing in extended Higgs sectors can modify the couplings 
of the mass eigenstates. Mixing and other new physics can also change the width and 
allow new exotic or invisible decay modes (Chang et al., 2008; Curtin et al., 2014). New 
physics can lead to Higgs-mediated flavor changing effects (Buras et al., 2010a; Blankenburg 
et al., 2012), or can modify the Higgs self-interactions substantially, especially the induced 
H? coupling (Problem 8.22). Heavy particles such as superpartners, additional quarks and 
leptons, and heavy W’ bosons are expected in many extended theories, including most of the 
composite Higgs models. These can enter into loops and significantly modify gluon fusion 
and such loop-induced decays as GG, yy, and Zy, or perturb tree-allowed decays such as 
H — bb. Of course, there may be associated effects such as the direct production of the 
new heavy particles at the LHC, modification of the oblique parameters or other aspects of 
precision electroweak physics, the observation of FCNC, or the modification of high energy 
VV’ scattering, where V,V’ = W or Z (see, e.g., Baak et al., 2013; Szleper, 2014; and 
Section 8.4). 

Possible deviations from the SM predictions for the Higgs couplings have been studied 
quantitatively in specific classes of models and from a model independent or general ef- 
fective operator framework, e.g., in (Giardino et al., 2014; Bélanger et al., 2013; Contino 
et al., 2013; Elias-Miro et al., 2013; Herrero, 2015; Ellis et al., 2015; Englert et al., 2014). 
Typical deviations are in the 1-10% range. Considerable improvement on the current ob- 
servations shown in Figure 8.38 are expected from future running at the LHC, especially 
with a luminosity upgrade. These could reach a precision of ~(5-10)%. Even more precise 
measurements would be possible at a future e~e* collider, such as the International Lin- 
ear Collider (ILC), proposed to be built in Japan. For example, the ILC running at 250 
GeV would be able to determine the total H width by measuring the total e~e* + ZH or 
e-e* — ZH > ZZZ* rate and combining it with the H + ZZ* branching ratio. Increasing 
the ILC energy to 500-1000 GeV would allow studies of WW fusion and measurements of 


37 Operationally, the Little Higgs models involve additional heavy vectors, fermions, and scalars that cancel 
the one-loop quadratic divergences. 
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the other couplings, including the ttH and H? couplings, many at the 1% level. Of course, 
the theoretical predictions of the SM would have to be computed to comparable preci- 
sion (Denner et al., 2011; Almeida et al., 2014; Lepage et al., 2014). High intensity circular 
ee? colliders have also been proposed at CERN (the FCC-ee) and in China (the CEPC), 
and a very high energy (several TeV) linear collider (CLIC) at CERN. Future prospects for 
various facilities are surveyed in (Dawson et al., 2013; Asner et al., 2013; Bicer et al., 2014; 
Bechtle et al., 2014b; Arbey et al., 2015). 


8.6 THE CKM MATRIX AND CP VIOLATION 


We saw in Section 2.10 that CP violation can occur in a field theory whenever the La- 
grangian density involves more complex parameters than can be removed by field redefini- 
tions. Nevertheless, CP violation is observed to be very tiny, and it came as a surprise to 
most physicists when such a violation was first observed in 1964 (Christenson et al., 1964). 
It is straightforward to write down phenomenological models of new interactions that acco- 
modate CP violation (for a review of early attempts, see Commins and Bucksbaum, 1983). 
However, when the SM was developed it was difficult to directly incorporate CP violation, 
because for one or two families the SM is sufficiently simple that CP emerges as an acciden- 
tal symmetry. One possibility was to extend the Higgs sector, allowing C P-violating effects 
associated with the scalar exchanges (e.g., Weinberg, 1976). In this case the weakness of CP 
breaking would be attributed to the small Higgs Yukawa couplings. Another was to assume 
the existence of an entirely new very weak interaction to mediate the CP violation, such 
as in the superweak model (Wolfenstein, 1964). A third possibility, apparently realized in 
nature, is to introduce a third fermion family (Kobayashi and Maskawa, 1973). Observable 
CP violation would only occur when all three families are relevant to a process, accounting 
for the weakness. 

On the other hand, most extensions of the SM have potential new sources of CP violation 
that are not naturally suppressed, so CP violation studies are important. Moreover, C.P 
violation is necessary to explain baryogenesis. It is likely that the origin of the baryon 
asymmetry is related to CP phases beyond those in the CKM matrix, possibly those related 
to neutrino mixing or perhaps associated with BSM effects observable at the LHC. 

This section will give a brief introduction to CP violation and mixing effects, mainly in 
the K and B meson systems, and the closely related status of the CKM matrix. Much more 
detailed treatments may be found in (Commins and Bucksbaum, 1983; Jarlskog, 1989; 
Branco et al., 1999; Kleinknecht, 2003; Ibrahim and Nath, 2008; Sozzi, 2008; Bigi and 
Sanda, 2009; Schubert, 2015; Patrignani, 2016). CP and time reversal (T) invariance are 
equivalent in a C'PT-invariant theory in the sense that any C’P-odd operator is also T-odd. 
Nevertheless, CP and T are different transformations with different consequences (e.g., 
Bernabéu and Martinez-Vidal, 2015). Recent direct observation of T-violation involving 
K and B oscillations, and constraints from electric dipole moments, will be considered in 
Section 8.6.5. Strong CP violation, associated with the parameter Ogcp introduced in (5.2), 
and baryogenesis will be touched on in Section 10.1. 

Most of the processes considered involve nontrivial complications from the strong in- 
teractions. Enormous effort involving perturbative techniques such as HQET and SCET, 
as well as non-perturbative lattice calculations and chiral perturbation theory have been 
required to overcome those obstacles. 
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8.6.1 The CKM Matrix 


In Section 8.2.2 the quark and lepton mixing matrices V, and V; were seen to arise from the 
mismatch between the fermion gauge and Yukawa interactions, i.e., between the weak and 
mass eigenstates. They also required a mismatch between the u and d (or e and v) sectors, 
as is apparent in (8.60) on page 272. The empirical forms of V, were briefly discussed for 
two and three quark families in (8.65) and (8.66), and in Chapter 7. Here, we consider V, 
for three families, i.e., the CKM matrix, in more detail. 

As already described, after imposing unitarity and removing unobservable phases by 
redefinitions of the chiral quark fields, V} involves three mixing angles and one C’P-violating 
phase. There are a number of possible parametrizations, but we will follow that of the 
Particle Data Group (see the article on the CKM matrix in Patrignani, 2016), 


Vaa Vus Vub 1 0 0 C13 0 siget? C2 s2 0 
V= | Vea Ves Veo} = ]0 c23 s23 0 1 0 —sı2 &2 0 
Via Vis Veo 0 —s23 C23 se 0 c3 0 0-1 
C12C13 $12C13 s137"? 
= | —s12c93 — c12803813e" C12C23 — $12593813e" $23C13 |, (8.214) 
512523 — €12003813€ —C€42823 — 812C23813e"° €23C13 


where c,; = cos6;; and s;j = sin 0;j. The mixing angles are 012, 013, and 093, and 6 is the 
C'P-violating phase. Assuming unitarity, the magnitudes of the elements of V} are 


0.9743 0.225 0.0036 
\Vijl~ | 0.225 0.974 0.041 |, (8.215) 
0.0088 0.040 0.9992 


which implies s12 ~ sin ĝe ~ 0.225, while s13 < s23 < 512. As we will see, the observed 
smallness of CP violation is not because 6 is small, but rather because observable violations 
require that all three families contribute to the relevant transition amplitude, and therefore 
are suppressed by small mixing angles. It is often convenient to employ the approximate 
Wolfenstein parametrization (Wolfenstein, 1983) 


1— d?/2 À AX (p — in) 
V= —À W972 AX? + O(A*), (8.216) 
Ad3(1 — p — in) —A)? 1 


where A ~ sin ĝe. The powers of À incorporate the suggestive hierarchical pattern in (8.66) 
on page 273, while A ~ 0.811 (from Væ), p, and 7 are real and of O(1). The displayed terms 
are unitary through O(A). The higher-order corrections ensure unitarity to higher order, 
but are not important in practice. CP violation is associated with n, i.e., tand = n/p. There 
are various possible conventions for the CP phase in V}. However, the quantity 


(8.217) 


is independent of the convention. From (8.216) 
p=p(l-X/2),  F=n(l—2/2), (8.218) 


up to corrections of O(A“). V} takes the same form as (8.216) when written in terms of p 
and 7 to that order. It is also convenient to define the Jarlskog invariant 


J = Sm (Vus Veb Vag Ven) ~ €12€23C}3812823513 sind ~ APG, (8.219) 
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which is a convention-independent measure of CP violation. 

Measurements of the elements of V} are important to test the consistency of the SM 
and determine its parameters, which are relevant for other quantitites and processes such 
as electric dipole moments or the CKM contribution in models of baryogenesis. It is espe- 
cially important to verify that V} really is unitary, i.e., Vavi = Vİ V = I. A violation or 
apparent violation would signal the presence of new physics. This could take the form of 
a fourth family or heavy quarks with exotic SU(2) assignments, so that the 3 x 3 CKM 
submatrix would not by itself be unitary. Alternatively, it could involve new interactions like 
supersymmetry, leptoquarks, compositeness, or a heavy W’ (e.g., coupling to right-handed 
currents) or Z’ gauge boson that were not properly included in the analysis and therefore 
led to an incorrect determination of some of the elements. Unitarity studies include tests of 
weak universality (the diagonal elements), especially 


(VaV) 41 = [Vual? + |Vus|? + [Vas]? = 1, (8.220) 
as well as unitarity triangle tests (off-diagonal elements), such as 


(VÄ Va)a; = Vab Vua + Vå Vea + VaVia = 0. (8.221) 


Magnitudes of the CKM Matrix Elements and Weak Universality 


We first briefly survey the magnitudes of the CKM matrix elements. For more detail see, 
e.g., (Antonelli et al., 2010b; Porter, 2016) and the articles on the CKM matrix, on Vyas, 
and on Va c» in (Patrignani, 2016) (from which we take most of the numerical results). 
Lattice calculations of decay constants and other relevant form factor parameters, which 
are now very accurate, are reviewed in (Rosner et al., 2015; Aoki et al., 2017). 


|Vua|: As discussed in Section 7.2.5 the most precise determination is from superallowed 
0+ —+ 0* 8 decay, which only involves the vector current. The Ademollo-Gatto theo- 
rem ensures that the theoretical uncertainties are of second order in isospin breaking, 
e.g., Za’, though these must be taken into account, allowing a clean extraction of 
Gr|Vua|. Taking Gr from muon decay yields |V,,q| = 0.97417(21). The neutron life- 
time yields an independent determination of G3|Vial?[1 + 3(ga/gv)*], but the pre- 
cision of the extracted |V,,a| is limited by the measurements of the lifetime and of 
ga/gv from the decay asymmetries. Pion beta decay, 7+ — m°ety,, also yields a 
theoretically clean measurement (Section 7.2.3), but with a larger uncertainty. The 
value extracted from the 7 > uv rate has a relatively large error dominated by the 
theoretical uncertainty in fr. 


[Vus 


: The Ką decays also involve only the vector current, implying by the Ademollo- 
Gatto theorem that corrections to the form factor f+(0) are second order in SU(3)- 
breaking. Nevertheless, there are still theoretical uncertainties in the determination of 
|Vus|, as described in the cited reviews and in Section 7.2.3. The experimental value 
f+(0)|Vus| = 0.2165(4) (with f,(0) normalized to unity in the SU (3) limit), combined 
with the lattice value f, (0) = 0.9677(37), yields |V,,,| = 0.2237(9). A comparable re- 
sult can be obtained from the ratio of K,,2 and 7,2 decay rates, using lattice estimates 
of the decay constants. fx / fz = 1.1928(26) leads to |V.,.| = 0.2254(8), which averages 
with the Kgs result to give |Vus| = 0.2248(6). Other determinations involving hyperon 
and hadronic 7 decays each involve their own theoretical uncertainties, as detailed in 
Chapter 7, but yield reasonably consistent results. 
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\Veal, |Ves|: [Vea] can be obtained from semi-leptonic (D —> rév) and leptonic (D > £v) 


decays, using lattice calculations of the form factor (at Q? = 0) and decay constant, 
leading to an average |Vea| = 0.218(5). |Vea| = 0.230(11) can also be obtained from 
the deep-inelastic processes v(v)N — ptX + c(é) as described below (8.109), im- 
plying a combined value 0.220(5). Similarly, semi-leptonic (D > Kév) and leptonic 
(Ds > v) decays imply |V.,| = 0.995(16). An indirect determination from the 
four LEP 2 experiments utilized the leptonic branching ratios W — ¿v measured in 
e-e+ > W-Wt — ff ff. From (8.155)-(8.157), 


as (May 2 1 
n=d,s,b 


where Be, is the average of the £ = e, u, and T branching ratios. Assuming lepton 
universality, this implies (Patrignani, 2016, LEPEWWG website), 


XC [Vma]? = 2.002(27), (8.223) 
consistent with the expectation of 2 from CKM universality. Using the measured 


values for the other elements, one obtains |Ves| = 0.976(14). Deep-inelastic neutrino 
scattering only determines $|V..|7, where S is the s quark momentum fraction. 


|Vubl, |V|: The values |V,,,| = 0.00409(39) and |V.,| = 0.0405(15) are obtained from in- 


clusive and exclusive semi-leptonic B decays. As described in Section 7.2.7 there is 
tension between the exclusive and inclusive determinations. There are also somewhat 
less precise measurements of |V,,,| from B —> rv and of |V,,,|/|V.5| from semi-leptonic 
Ay = udb decays. 


Vials |Visl, Viol: [Veal and |Vi;| are mainly constrained by the mass differences Amg,, ob- 


tained from B° — B° and B? — B® mixing, which are driven by box diagrams involv- 
ing two W’s, similar to Figure 7.3 (Section 8.6.4). Assuming |Væ| ~ 1, one obtains 
|\Via| = 0.0082(6) and |V;,| = 0.0400(27), with the uncertainties dominated by the un- 
quenched lattice calculations used to obtain the matrix elements. These uncertainties 
are reduced in the ratio Amg/Am,g, implying the more precise |Viq/V;s| = 0.216(11). 
Other constraints from radiative decays such as B > X,7 or Bs > u* u” are described 
in the reviews. |V| is measured directly in single top production at the Tevatron and 
LHC, either through the exchange of a virtual W in the s or t channel in qq or qq 
scattering, or by the associated production Gb — tW. The CDF, D0, ATLAS, and 
CMS results combine to give |V;_| = 1.009(31). 


Combining these results one obtains 


(Vaal? + |Vusl? + |Vasl? = 0.9996(5), (8.224) 
a s/-/ ~ 
B Ke3 negligible 


in impressive agreement with weak universality.” This is especially remarkable because it 


38For many years there was an apparent 2-30 deviation, which was often (wrongly) assumed to be due 
to some problem with the radiative corrections to superallowed 8 decays. The problem was resolved by a 
new generation of high precision K3 measurements that yielded considerably higher values for f+(0)|Vus| 
than the earlier ones (which may not have treated the radiative corrections correctly). For discussions, 
see (Czarnecki et al., 2004; Antonelli et al., 2010a; Patrignani, 2016). 
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confirms the theory at the loop level: without radiative corrections applied to u, 6, and K 
decays the sum would have been ~ 1.04, and those corrections are only finite and meaningful 
in the full SU(2) x U(1) gauge theory. The agreement also strongly constrains such new 
physics as a heavy Wp coupling to right-handed currents, W — WR mixing, and mixing 
between ordinary and heavy fermions. 

Similarly, the universality sums for the second row and for the first two columns are 


XO |Ven|? = 1.040(32) [1.002(27)], 


ane (8.225) 
XO [Vinal = 0.9975(22), S| Var = 1.042(32), 
m=u,c,t m=u,c,t 


where the first [second] value for 5>,,|Ven|? utilizes D decays and the LEP 2 result in 
(8.223), respectively. Since all of these results are consistent with universality, one can fit to 
the CKM parameters (i.e., assume unitarity) to obtain the values in (8.215). In particular, 
this yields |V| = 0.99915(5), which is much more precise than the direct measurement from 
single top production. 


8.6.2 CP Violation and the Unitarity Triangle 


Universality tests whether the diagonal elements of ViVa and ViVi are equal to unity. 
The unitarity triangle tests involve whether the off-diagonal elements, such as (V/'Vq),, 
in (8.221), vanish. The name derives from the fact that each term in the expression for 
(ViVa) a1 is a complex number, which can be thought of as a vector in the complex plane. 
Unitarity implies that they should sum to zero, i.e., form a triangle. 

There are six unitarity triangle tests, but we will concentrate on (Vivi) aia which is 
illustrated in Figure 8.41. The goal is to overconstrain the triangle by redundant tests, to 
check whether unitarity really is satisfied and to determine the parameters. In particular, 
the lengths of the sides can be determined from CP conserving rates, such as the b > u 
decay rate or B? — B° mixing, as described above, while the angles can be determined 
independently by CP-violating effects. Any deviation would signal the presence of new 
physics, such as new fermions, new sources of C'P-violation (e.g., loops involving supersym- 
metric particles), or new flavor changing neutral or charged current interactions (e.g., from 
extended Higgs sectors, a family-non-universal heavy Z’, or a heavy WẸ ). 


Vib Vua 


Ve Ved (0, 0) a1 (1, 0) 


Figure 8.41 Left: the unitarity triangle for (V,'V, = 0. Right: the triangle scaled 
8 q 4/31 


by Vä Vea, so that the vertices are at (0,0), (1,0), and (J, 7). 
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In the Wolfenstein parametrization, 
(VÀ Va) = Veb Vua + Vå Vea + Vå Via 
~ (1 — *) AX (p + in) — Ad? + Ad} (1 — p — in) (8.226) 
~ AX’ (p + if) — AX? + AX (1 — p — if) = 0. 
[Vš Vea| ~ AA? is known accurately, so it can be divided out, 


(VIV) S 


D+ iñ =] 1-5- iñ. 22 
D3 p+ ii +1-p-iñ (8.227) 


Vi Vua/AMB VåVea/AX V Via/AA3 


The rescaled unitarity triangle is therefore the sum of the vectors p+i7, —1, and 1— p — iñ. 
The sides are of length \/p2 +72, 1, and \/(1 — p)? + 72, respectively, while the angles’? 


a, 0, and y defined in Figure 8.41 satisfy 
Vå Vea 
Vip Vea 


VŠ, Vu Við Vts 
y= arg (- ae) ~ — arg Vub ~ 6, bs = arg (i) mS arg(—Vis) 


Væ Vta 
V Vua 


B = arg (- ) ~ — arg Vig, a = arg (- ) ~ arg(—ViaVu) 


(8.228) 


y is essentially the same as the CKM phase ô in the parametrization in (8.214), while 6, is 
the analog of 6 for CALA Pe The Jarlskog invariant J in (8.219) is twice the area of the 
original (unrescaled) version of the triangle. In fact, all six unitarity triangles can be shown 
to have the same area. 

The consistency of this picture can be tested by determining the points (f,7) by redun- 
dant means to see whether they agree. One constraint follows from 


~ yp +7? ~ 0.45(5), (8.229) 


which yields the circular annulus centered at the origin marked |V,,,| in Figure 8.42. To 
proceed we need further inputs from the K and B systems. 


Vib Vud 
Vö Vea 


8.6.3 The Neutral Kaon System 


The neutral K? — K? system has been extremely important in particle physics (for reviews, 
see, e.g., Commins and Bucksbaum, 1983; Winstein and Wolfenstein, 1993; Kleinknecht, 
2003; Cirigliano et al., 2012; Schubert, 2015; Patrignani, 2016). The magnitude of the mixing 
between the K° and K°, induced by second-order weak effects, led to the prediction of the 
charm quark and of its mass, and has been a stringent constraint on new sources of flavor- 
violating physics at the tree and loop level. CP violation was first observed and studied 
in the neutral kaon system, and provides even more stringent limits on some kinds of new 
physics. 

The states K? and K? defined in Section 3.2.3 are the isospin partners of the K+ and 
K—, respectively, and carry strangeness +1 and —1. In the quark model, 


|K°) = |ds), |K°) = |sd). (8.230) 


39The alternative notation ¢; = 8, ¢2 = a, and ¢3 = y is used frequently, especially by the Belle 
collaboration. 
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Figure 8.42 The unitarity triangle, showing the consistency of various C'P-conserving 
and violating observables from the K and B systems. Plot courtesy of the CKMfitter 
group (Charles et al., 2005, http: //ckmfitter.in2p3.fr), with kind permission 
of The European Physical Journal (EPJ). 


Under CP 7 7 
CP |E*)="x|K*),.. “CP |K?) = nġlK?°), (8.231) 


where ņg is a phase. As discussed in Section 2.10 one can always perform a field redefinition 
on the kaons (or on the quarks) to change nx; we use this freedom“? to choose ng = —1. 
The strong interactions conserve strangeness and are CP invariant, so K? and K? are the 
relevant states for describing strong interaction transitions. For example, 7p — KA is 
allowed, where |A) = |sdu) is the S = —1 isoscalar hyperon, while t~p — K°A is forbidden. 
The two states are degenerate by CPT with mass mgo = 497.6 MeV. This is 3.9 MeV larger 
than m+. Similar to the n — p mass difference, this is of opposite sign from the expected 
electromagnetic contribution, and is due to the quark mass difference mg > my. 

However, the weak charged current interactions can violate strangeness and lead to 
K? — K? mixing at second order (in Gr). This implies that the K? — K? system is described 


by a 2 x 2 mass matrix, 
ih ws ee a (8.232) 
KK Ke 


where Mgo = Mgo (~ mgo) by CPT invariance.^! Mpg is the weak mixing term, gen- 
erated by the diagrams in Figure 8.43. It is tiny, but its effects are important due to the 
degeneracy of the diagonal terms. 

If we first ignore the third family then Mj, is real (i.e., CP is conserved), and the 


40Other frequently used conventions are ng = +1, and the Wu-Yang convention (Wu and Yang, 1964), 
which makes the amplitude Ao defined in (8.283) real. The latter is somewhat awkward in the standard 
model. The ng = —1 convention follows from (8.230) if we define the d and s quarks to have the same 
intrinsic C and P phases (Problem 8.26). 

41See (Commins and Bucksbaum, 1983; Sozzi, 2008; Schubert, 2015) for a general discussion allowing for 
the possibility of CPT violation. 
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Figure 8.43 Second-order diagrams leading to K°— K? mixing in the standard model. 


eigenstates of M are the maximal admixtures 


K? z K? 
V2 


The expression for K fo will continue to hold as a definition even after C P-violation is turned 
on, while Ks,z (S and L stand for “short” and “long,” respectively) will represent the mass 
eigenstates in the more general case. K? and KÌ are CP eigenstates, with eigenvalues +1 
and —1, respectively. The states |r+~) and |7°7°) are also CP eigenstates, 


K? = ~ Ks. (8.233) 


CP\xtn7) = +rr), CP|n®r?) = (—1)"|n"r"). (8.234) 


But L is even for |n?r?) by Bose statistics, so CP conservation allows K}? = Ks —> 27 but 
forbids K} = K; —> 2r. Similarly, 


CP\nxta- r?) = (-1)8"|nta- n), CP|n?r?r?) = —|r? rn), (8.235) 


where Isr = 0, 1,2,3 is the total isospin of the rtr ~r? system and we have assumed that 
the total angular momentum is zero. [3, = 0 or 2 can only occur when there are internal 
orbital angular momenta for the 37 system, leading to a strong centrifugal suppression. 
Therefore, Kz — 37 is allowed, while Ks — 37 is strongly suppressed. Since the phase 
space for the 27 decay modes is much larger than for 37, one expects a much shorter lifetime 
for Kg than for Kz, motivating the terminology. This picture is indeed approximately 
valid: The lifetimes Tx, and Tg, are, respectively, 0.8954(4) x 10719 s ~ (2.7 cm )~! and 
5.12(2) x 1078 s ~ (15.3 m )~?, with their non-leptonic decays almost exclusively 27 and 
3m. The Ke3 modes are competitive for Kz, but are only of O(10~3) for Ks, as can be seen 
in Table 8.7. However, the assumption that CP is absolutely conserved was invalidated by 
the observation of K, — 27 decays at the 107? level (Christenson et al., 1964). We will 
return to the subject after discussing the calculation of K? — K? mixing in the standard 
model. 


Calculation of Amg 
The mass difference between the eigenstates in (8.233) is 
Amg = MK, — MK, = 2M. (8.236) 


We will see below that it is predicted to be 


2 
Amr ~ SE m |Val Vas fem Br, (8.237) 


up to third family and QCD corrections, where fx ~ 1.2f, ~ 160 MeV is the kaon decay 
constant defined in (7.78), and Bg = O(1) is associated with the matrix element of the 
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TABLE 8.7 Principal branching ratios for the neutral K decays. 


Ks > n?n? (Kor) 30.7% 
> ntn (Kar) 69.2% 
> nte*ve(De) (Kes) 7.0 x 1074 
Kr > nte®ve(De) (Ke3) 40.6% 
> nF u” vaD) (Kus) 27.0% 
> ntron? (Kaz) 12.5% 
ga? Ea 19.5% 
> ntr 2.0 x 1073 CP-violating 
> 77 8.6 x 10-4 CP-violating 


"The charged K branching ratios are given in Table 7.2. 


effective operator corresponding to Figure 8.43. Amg is much too small to be measured 
kinematically, but it can be determined indirectly by the regeneration technique described 
below to have the extremely small value 3.484(6) x 107° eV. Using (8.237) with Bx = 1 one 
obtains the prediction me ~ 1.5 GeV (Gaillard and Lee, 1974b), in reasonable agreement 
with the value in (5.124) on page 194. In fact, the QCD corrections are non-negligible and 
there is still a non-trivial uncertainty from Bg, but it is clear that the SM prediction is 
qualitatively correct. Without the GIM mechanism (Glashow et al., 1970) the box diagram 
contribution would have been orders of magnitude larger, and there would also have been 
a tree contribution from the flavor changing 5dZ vertices in (8.78). Assuming that no new 
physics contributions to Amg can be much larger than the experimental value also signif- 
icantly constrains new tree-level physics leading to flavor changing couplings and certain 
types of new box diagram effects, as will be further considered in Section 8.6.6. 

Now, let us derive (8.237). Box diagrams similar to Figure 8.43 will lead to an effective 
|AS| = 2 operator 


AS|=2 a = 
fee = CIAS| 2 dy” (1 T 5)s dya (1 _ ~°)s Ee (8.238) 
We will work in the ’t Hooft-Feynman gauge, which means that we must also include the 


box diagrams in which one or both of the W’s are replaced by the Goldstone bosons w* 
as described in Section 4.4. We start with the schematic expression 


bi 


A E he ae 
icli = gT (CO), (8.239) 


where £ includes both the WCC couplings in (8.55) and the Yukawa couplings in (8.91): 


(8.240) 


= My My 
+ d Ai +)| uw +u Mva -7°)| aut}. 


We have neglected ma,, (the b terms are irrelevant) and have used V2/v = g/V2My. 
Expanding £^ and contracting the appropriate pairs of WWT, wtw7, and uniin fields, 
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using the W and w propagators in (4.68) and (4.63), and neglecting external momenta, 


4 4 2 
|AS|=2__ 5, (_9 ee f d*k 1 1 1 
Leff ' (5) 3 AmAn (27)* (k2? — MZ, k? — m2, k? — m2 


m,n=u,c,t 


x [dy K (1 — 7°) djy Kiull — 7°)8 — 2tmtnMyydy"(1 — 7°) dy,(1—°)s 


+2m2nd (1 —y°)s d K(1—°)s] + h.c., (8.241) 
where > 
My 
In = Mz,’ n=u,c,t (8.242) 
and 
An = av XO M=. (8.243) 
n=u,c,t 


The three terms are associated with the WW, Ww, and ww diagrams, respectively. In 
the first and last terms the ¢ parts of the numerators of the fermion propagators survive, 
while in the Ww term the fermion mass term survives. The coefficient reflects the fact that 
there are 4!/2 equivalent terms for the WW and ww diagrams, 4! for the Ww, and that 
(1 — 7°)? = 2(1 — 7°). One can easily show that 


dyt iy" (1 —7°)8 dw Ky (1 — 7°)s = 4d K(1—7°)s d K(1—7°)s (8.244) 


by using the Fierz identities in (2.215) or by using the identity in Problem 2.11. Furthermore, 
one can replace kpko by gpok?/4, since there is no other four-vector in the dtk integral. 
Finally, one can employ the Wick rotation and angular factor as in (E.15) to obtain 


Š G2 M2 
clasi=2 — _SrMw A“ AS FM 8.245 
pe mae (8.245) 
where 
FM = (1 i Tmn) B?” 4 28min BP” 
g% eis (8.246) 
Bg” = ; 
i J (2 + 1)?(2 + £m)(2 + £n) 
The integrals are elementary, and yield 
Aklim) — AklEn 
Bpr — ArEm) — Arlen) (8.247) 
Lm — Tn 
with 21 i 
1 x Ine 1 xine 
A = A = i 8.248 
(2) Le Ua (2) l-g (1-2) ( ) 
To verify (8.237), let us neglect the mixing with the third family, so that 
Au = —Ac = cos ĝe sin be, (8.249) 
and Gum 
clAsi=2 = — =F W X [FM 4 Fe — QF, (8.250) 


167? 
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£e is very small (~ 3.5 x 1074 for me ~ 1.5 GeV), and z, is negligible for either the current 
or dynamical mass. To leading order in ze, 


rw I, F! w 1+ 3z£e + 2zeln £e, F” m 1 + £e + zeln ze, (8.251) 


so the quantity“? in square brackets in (8.250) is £e. 
Now consider the matrix element 
(K0|clAS|=2) Koy 


Mpk = . (8.252) 


The 1/(2mxg) follows from our covariant state normalization, noting that 
(B"H |p) = Ep(B' IB) = (27)°5°(B" — B)(B"|H(0) |p), (8.253) 


and using (2.3) and translation invariance (1.15). To estimate the matrix element, we will 
use the vacuum saturation approximation, in which one inserts the vacuum state between 
the ds operators in all possible ways, i.e., 


(K°|dy"(L— 7°)s dyu (l — 7°)s|K°) vac 
=2(K°|d*y#(1 — 7°) 520) (0d (1 — 7°) 85|K°) (8.254) 

+ 2(K°|d*4"(1 — 7°)sg|0) (Old? y#(1 — 7°) sal K), 
where we have reintroduced the color indices œ and 8 to keep track of them: the second 
term, which corresponds to the second diagram in Figure 8.43, is obtained by performing 
a Fierz transformation before inserting the vacuum. The 2 comes from interchanging the 
two ds factors. The matrix elements of the vector currents vanish by reflection invariance, 
and the axial matrix elements are given in the SU(3) limit by (7.76) on page 241, where 
dypys = Ap + iAT, = (8y,7°d)'. Our phase choice ng = —1 in (8.231) corresponds to 

1 

V2 


1 


356 — iT), (8.255) 


|K°)=—s|6 +17),  |K°) = 
implying 7 g 

(0d (1 — 7°) sa|K°()) = (013P (1 — 7°) da] K? (4)) = tifa", (8.256) 
with f = fr. In fact, SU(3) breaking is important, but from isospin one has that f should 
be identified with the fx from K- —> pp, in (7.78). Similarly, 


_ 2 ey 
(O|d*y"(1 — 7°)sg|K°(q)) = -i ifgq", (8.257) 
so that : 
(K°|dy"(1— 7°) dy —7)8|K° vac = 3 fe Mix: (8.258) 


Vacuum saturation is not expected to be a good approximation, but is useful as an order 
of magnitude estimate. It is therefore conventional to define a factor Bx to describe the 
departure from vacuum saturation, i.e, 


- - o8 
(K°\dy#(1—-°)s aL 7°)s|K°) = gfkMKBK. (8.259) 


“Without the c quark, the corresponding factor would be F““ ~ 1, about 3000 times larger, as stated 
below (8.237). 

43Qlder references often quote a value 3/2 larger. This was based on a pre-QCD calculation involving 
only one color of quark. 
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Combining (8.236), (8.250), (8.252), and (8.258), one reproduces (8.237). 
QCD corrections (e.g., Buchalla et al., 1996) to C!45!=?, which also involve additional 
penguin diagrams, are significant. Including the t quark, 


2 GEM is P ays 
ClAsl=2 — Ts [A277 So (xe) + A% N2So(xz) + 2A* A350 (ae, ae) | F(u), (8.260) 
where 71.2.3 = 1.87(76), 0.5765(65), 0.496(47) are short distance QCD corrections,** de- 
fined to be independent of the renormalization scale u, and F(u) contains scale, scheme- 
dependent, and higher-order factors. Bx, defined in (8.259), also becomes p-dependent, 
but 

Br = Bx(u)F(u) (8.261) 


is an observable that is scale independent. Bx may involve both short distance and long 
distance contributions, where the latter are associated with virtual hadrons rather than 
quarks. Lattice calculations (Aoki et al., 2017) yield Bx = 0.763(10). The So functions in 
(8.260) are obtained from (8.245) after eliminating A, = —(A- + Az). After a somewhat 
tedious calculation, one finds 


Solze) = 2.48 x 1074 


Ar, — 112? + 23 3a3 In 2; 
= N 2, 
Solz) 4(1 — x)? 2(1 — x4)? 33 (8.262) 
Tt 324 3a? In z 


~ 2.20 x 1073 


Solte, 2t) = £e |l 
etd men aa a) a — 2, 


to leading order in x, (Inami and Lim, 1981). The numerical values are for the MS masses 
Mele) = 1.27 GeV and fulu) = 164.1 GeV (Patrignani, 2016). Combining the various 
results one predicts Amg ~ 3.08 x 107° eV, close to the experimental value 3.484(6) x 107° 
eV. The difference is presumably due to (difficult to estimate) long distance contributions 
not described by the box diagrams in Figure 8.43. 

Complex phases in the X,,’s lead to CP violation. This requires not only the existence 
of three families, but that all three are relevant to the process. Otherwise, the factors could 
be made real by a redefinition of the fields. 


Ks, Decays, Oscillations, and Regeneration 


We saw following (8.235) that, assuming CP conservation, Ks decays rapidly to 27, while 
Kz — 3r and to semi-leptonic modes with a lifetime nearly three orders of magnitude 
longer. However, in 1964 the decays Kz — 27 were observed, suggesting a tiny violation 
of CP. To see how this was done, let us consider an idealized experiment in which one 
starts with a pure K? beam, which could be produced for example by the strong interaction 
process 7p + K°A in a target. In particular, consider a single particle state |Y(T)} that 
is initially a K? with definite velocity £, 


1 
v2 


44The quoted values include NNLO or NLO corrections (Brod and Gorbahn, 2012, and references therein). 


1y(0)) = |K°) = —. (IKs) + |K1)). (8.263) 
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If one could ignore the decays, then at a later proper time T = d/8By, where d is the distance 
travelled, the state would have evolved to 


-1 
-5 
= [cos (231) |K?) — isin (231) \K°)| eM KOT 


That is, there is a probability sin? Amc that the K? will have oscillated into a K°. This 
is of course an example of the two-state problem familiar in quantum mechanics, which is 
itself reminiscent of the classical coupled oscillator system. We will encounter it again for 
B? — B® and neutrino oscillations. However, in the neutral kaon system the oscillation time 
2r/AmxK ~ 1.2 x 107° s is long compared to Tg, so we must take the decays into account. 
This can be done by replacing (8.264) by 


WE) = 5 [lKs)e T + [Ke | 


(8.264) 


-TET 


1 E 
Baja So 4 Kye mete |. (8.265) 


V2 


The factors exp(—I's,r7/2) represent the depletion of the beam into the 27 and 37 (or 
semi-leptonic) channels, respectively, with rates T's, = 1/Tks,„. (These factors could be 
incorporated in a more elegant manner by considering the multi-channel system including 
the decay states or by introducing a density matrix, but the simple exponential factors are 
adequate for our purposes.) Clearly, one observes 27 decays near the source from the Ks 
component. Far away, for Tks < T < TK,, one has an essentially pure Kz beam and one 
expects to observe only 37 and semi-leptonic decays. 

Before turning to CP violation, let us consider the kaon regeneration technique that can 
be used to measure Am x. The basic idea is familiar from measurement theory in quantum 
mechanics, or from the use of polarizers in optics. Suppose one places a piece of matter, 
known as a regenerator, a distance dı downstream from the source. If the corresponding 7; 
is much larger than Tg, the beam entering the regenerator is a pure Kz, i.e., 


I(r) = = |lKs)e e 


|) + |K°) 
Tn) = |K) = ————_ .,, 8.266 
Y(T) = |Kx) Ji (8.266) 
where we have renormalized the coefficient to unity. The two components can scatter or be 
absorbed in the regenerator with different amplitudes, so the state emerging on the other 
side at T2 = Tı + € is 


I(72)) = a|K®) + BLK) = < Kx), (8.267) 
where we can again renormalize so that |a|? + |b|? = 1. For a 4 b the Ks component has 
been regenerated. One typically expects |b| < |a| since the K? component can be strongly 
absorbed by K°p — mA in the regenerator, while there is no analogous reaction for K°. 
In the extreme case of b = 0, a pure K? emerges. 

Now, suppose the regenerator is placed close to the original source, so that |¢(7,)) has 
a non-negligible Ks component. The Ks, components can then interfere with each other 
so that the (unnormalized) state emerging from the regenerator is 


b(72)) = [(a + b)A + (a — b)] |Ks) + [(a — b)A + (a + b)] |x), (8.268) 


where A = exp(tAmx72 — I sT2/2), we have removed a common phase, have approximated 
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exp(—I',72/2) ~ 1, and have neglected the thickness of the regenerator. The intensity of 
Ks in the regenerated beam can be measured by observing the 27 decays, with a rate 
proportional to |(Ks|w(72))|?. For b ~ 0 this is 


(Kg |th(72)) |2 x E + e7Tst 4 2e7TsT/2 cos (Amx T2)| . (8.269) 


Amg can then be determined by varying dı ~ də. In practice, regeneration experiments 
usually involve two regenerators, allowing Amg and CP-violating parameters to be deter- 
mined by varying the distance between them or utilizing the energy spread of the beam. 


CP Violation in K decays 


In 1964, Fitch, Cronin, and collaborators observed 27 decays in a neutral kaon beam pro- 
duced at the Brookhaven AGS some 300 Kg decay lengths from the source, implying the 
CP-violating Kr, + 27 decays at the 2x 107° level. Subsequent observations utilized the re- 
generation technique, which allowed the measurement of both the magnitudes and phases of 
the CP-violating parameters (as well as Amg). Since regeneration measures interferences, 
it also eliminated possible alternative explanations of the events, such as the emission of an 
unobserved third exotic particle in the decay. 
There are two possible sources of the CP violation: 


CP violation in the K? — K? mixing (indirect violation). This would be induced by 
an imaginary part in Mpg defined in (8.232), so the mass eigenstates Ks,z no longer 
coincide with the CP eigenstates K? in (8.233). 


C P-violation in the decay amplitude (direct violation). 


We now understand that both effects are present, though the indirect C P-violation is much 
larger. They are both understandable in the SM as being due to the CP-violating phase in 
the CKM matrix for three families. This phase can be large, but the observed effects are 
strongly suppressed by small mixing angles. 

Define the CP violating parameters 


= [yle we te 
(8.270) 
= [oole ~ e= 2e', 


where A(Ks, > ntn, n?n?) are the decay amplitudes. The difference between them is 


due to the fact that the 27 state can have isospin 0 or 2. As commented briefly in Section 
7.2, the I = 2 amplitude is much smaller than the J = 0 one (the AJ = 4 rule). € can be 
generated by mixing (indirect) or by direct breaking in the J = 0 amplitude, though the 
former is more important for our phase convention. ¢’ is due to a phase difference between 
the J = 0 and 2 amplitudes; it indicates direct violation but is suppressed by the AI = $ 
rule. 

|74—| and |ņoo| can be measured from the rates for the various 27 decays, while the 
phases y,— and Yoo can be measured by interference effects in regeneration experiments. 
A fit to the results yields (Patrignani, 2016) 


Ino] = 2.220(11) x 1073, Poo = 43.52(5)° 


3 (8.271) 
In}-| = 2.232(11) x 1073, 4- = 43.51(5)°, 
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and the corresponding values 


1 
lel = 2.228(11) x 107%, Re(e'/e) ~ 3 (1 2 | 


) = 1.66(23) x 107°. (8.272) 
N+ - 


The Re(e’ /e) measurement was especially difficult since it involves differences between two 
small effects, but was eventually determined precisely by the KTEV (Fermilab) and NA48 
(CERN) collaborations (Sozzi and Mannelli, 2003). CP breaking is also observed in the 
difference between the K¢3 decay rates, 


_T(Kr >m fy) -T (Kia ated) _ 3.32(6) x 1073 (8.273) 
D) 7 ` : 


ob = I (Ky oa ftv) +T (Kr > rtp 


To interpret these results we need to find the eigenstates |g 7) (in the presence of CP 
violation) of the operator 
Tr o 
H =M -i> Aly) = i— 8.274 
iZ, Hb) =Ê), (8.274) 
which governs the time evolution of the system. M = Mt is the 2 x 2 mass matrix defined 
in (8.232), while r =I‘ is an analogous decay matrix. It is defined as 


Tar =X ppM(a> f) M> f), (8.275) 
f 


where pp are the coefficients and phase space factors for a decay into channel f defined in 
(D.7). In the absence of CP violation the eigenvalues of T are T's,z and (8.265) is the solution 
to (8.274). CPT conservation ensures that the diagonal entries are equal, i.e, Mpo = Mko 
and T o =T go. It is straightforward to find the eigenstates and eigenvalues (e.g., Commins 
and Bucksbaum, 1983), 


_ |K9) +K) 


|Ks) = p|K°) — q|K°) Tee 


(8.276) 
z ZIK?) + |K? 
[krj = p|K®) palt) = EEE, 
L+ |F 
where 
toe [m> rke]? 
q_i-e KK '~3 
= <= |, 8.277 
p 1+ë are (8:277) 


so that nonzero € requires imaginary parts for Mgg and/or I gg. The eigenvalues are 


T T | Ripi Tee 
mst — i i = (tio = 1 a) (Me s) (Mice =i s£), (8.278) 


so that 


Amg =m — ms ~ 2Re Meg, Irs-Irzr ~ -2ReT ex, (8.279) 
neglecting corrections of O(é). |Kz) and |K‘g) are not orthogonal, but satisfy 


2Re€ 


SS 2 č. 2 
(K1,|Ks) T+ jej? Re € (8 80) 
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This is due to the non-Hermitian nature of H in (8.274). From (8.277) and (8.278) one finds 
to O(é) that 


= = e% sin be 281 
: Tg me Ty, =- 2AMK i g Amg 2AMg i (8 i ) 
where 2A 
0. = tan`! LTK  _ 43.52(5)° (8.282) 
T's —-Tr 


is the superweak phase.*° 


Finally, let us define the amplitudes for K° and K? to decay into 27 states of definite 
isospin 7 = 0 or 2 and J3 = 0 by 


(I Iz = 0\Hm|K°) = Aye’ 


(I Is = 0|Hni|K°) = Až, eee) 
where Hn is the Hamiltonian density for non-leptonic transitions defined in (7.21). Ay is 
the weak part of the amplitude. It changes sign for K° because of (8.231) (with ng = —1) 
and (8.234), and the complex conjugation reflects the CP transformation, as described in 
Section 2.10. exp (iôr) represents the S-wave phase shift induced by the strong interaction 
final state interactions of the two pions, and is the same for both K? and K°. Empirically, 
Re A /Re Ao = w ~ 0.045 < 1 (the AJ = 5 rule), and 


oL = ô2 — ôo + 5 = 42.3(1.5)°. (8.284) 


The S-wave m+z~ and 7°r° states are related to the isospin states by 


1 2 1 
— (|rta-) + |xont)) = gi =0 =0)-—|=2 I, =0) 
v2 i v3 (8.285) 
1 2 
|r?r?) = -f4r= 0 Is =0)+ air 2 I3 = 0), 
where we have used the conventions in (3.91). 
Combining (8.270), (8.276), (8.283), and (8.285), one obtains 
=e+ e+e 
TF weil F0) 2 (8.286) 
— 2e r l 
Wae S 
where Smd i Dui Dnd 
a SM AQ 1 io’ | SM Ag SM AQ 
= S e : 8.287 
omeri T ea e (Saer) 


Under the reasonable approximation that the 27 state with J = 0 dominates the decays the 
iSmT gg term in (8.281) becomes 


~N? 
Tg Re Ao , 


(8.288) 


45Tn the superweak model (Wolfenstein, 1964) all CP violation is due to a new superweak interaction 
that only contributes to the mixing. It was finally excluded by the observation of e’. 
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so that 


io = ~% 
a a Sm M Sm Ao 
em~ e'% sin 6. ( iE : 


Amg Re Ao 


Both € and ¢' involve phase differences, which cannot be rotated away by field redefinitions. 
However, the relative importance of the two contributions to € can be changed by choosing 
a different convention for ng in (8.231). For the common conventions the iSm Ao/Re Ao 
contribution is < 5 — 10% of the total and is usually neglected. If one neglects the second 
term in € and approximates the superweak phase 6, by 7/4 one obtains the commonly used 


approximate form 
eit /4 (S) 


(8.289) 


EN 


Pan (8.290) 


e' is predicted to be small because of the w factor. For that reason and because of the 
accidental near equality of 0e and 0! one expects 


Poo ~ P4- = be = 43.51(5)°, (8.291) 


in agreement with (8.271). This is often viewed as a test of CPT invariance, which was crit- 
ical in the derivation. When one uses the SM expressions for the WCC, the Keg asymmetry 
parameter in (8.273) is predicted to be 


Op ~ 2Reé = Wee = 3.23 x 107%, (8.292) 


consistent with the experimental value. 
The SM prediction for € is obtained by combining (8.260) and the related results with 
(8.290) (Buras, 2005), 


= Gr Mw fem 
6/272AmK 
We 


3.63 x 104 


x Gm A (Re Ae [mSo(we) — ns $o(ae,e)] — Re Av n2S0(21)), 


Br ke eit/4 
(8.293) 


where we have used )7,, An = 0 and the conventions for V} in (8.214), and have neglected 
Re Az compared to Re Ac. The factor ke ~ 0.94(2) estimates long-distance and other cor- 
rections (Buras et al., 2010b). The last term dominates. Using the experimental value for e€ 
and expressing the result in terms of the Wolfenstein parametrization, 


nl(1 — p) + 0.27(9)] ~ 0.43(5), (8.294) 


which corresponds approximately to the hyperbola labeled eg in Figure 8.42. The uncer- 
tainties are dominated by the A parameter and the QCD factors. 

e is due to direct CP violation in the decay amplitudes. It is believed to be dominated 
by the penguin diagrams (so-named for their appearance) shown in Figure 8.44. The gluon 
penguin contributes only to Ao in the SU(2) limit, and may be partially responsible for 
the AI = 1/2 rule. However, because of the small value of w, the electroweak (Z and 
y) penguin is also important. There is a cancellation between the diagrams, increasing the 
theoretical uncertainty considerably. Furthermore, there are corrections for isospin breaking, 
and the matrix elements are sensitive to the s quark mass (which is rather uncertain—see 
(5.123)), which enters in some approaches using the PCAC equations in (5.116) and (5.119). 
Estimates have utilized large N,, chiral perturbation theory, lattice, and other techniques for 
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the long distance effects, with results varying considerably, from agreement with experiment 
to significantly smaller values (for recent reviews and discussions, see, e.g., Cirigliano et al., 
2012; Buras et al., 2015b). For this reason, the uncertainties in the SM predictions for 
Re (e’/c) x Sm At x Ñ are too large to usefully constrain the unitarity triangle or to draw 
definitive conclusions about the possible need for new physics. 


=. . Xoo 
S. f u,e, t _ s_ f u,c, t _d 
Baia a te a 
Co 
= G sY 
ae eh r a 
d d d d 


Figure 8.44 The penguin diagrams for |AS| = 1 processes. Left: the gluon penguin, 
which only contributes to Ag. Right: electroweak penguin, which contributes to Ao. 


8.6.4 Mixing and CP Violation in the B System 


Mixing and CP violation in the neutral B system have been extensively studied, especially 
at LEP and the SLC; by CLEO at CESR (Cornell); at the asymmetric B factories BaBar at 
PEP-II (SLAC) and Belle at KEKB (KEK); by CDF and D0 at the Fermilab Tevatron; and 
by LHCb at the LHC (for reviews, see Artuso et al., 2009; Amhis et al., 2014; Patrignani, 
2016).*° The asymmetric B factories (Bevan et al., 2014) employed high intensity e+ beams 
of unequal energy so that, e.g., e~et —> T (4S) > B° B° yields B’ B® pairs that are boosted 
along the beam direction. This allows larger and more easily measured distances between 
the production and decay vertices, and allows the time between the decays to be measured 
by their spatial separation along the boost direction. The rates at the Tevatron and LHC 
are much higher, but are in the more difficult hadronic environment. The higher-energies 
at the hadron machines (and in some later Belle running) allow study of the B? system as 
well as the B°. 

The formalism for B? — B? mixing and CP violation, with i = d,s and BY = B®, is 
similar to that for the neutral kaons (see, e.g., Carter and Sanda, 1981; Buras, 2005; Nir, 
2005; Patrignani, 2016). One important distinction is that in the kaon system there is a very 
large difference between the Kz and Kg lifetimes, because in the absence of CP breaking 
the 27 state is only accessible to the Kg. There is no analog for the neutral B decays 
because of the many decay channels, and one expects similar lifetimes for the two mass 
eigenstates, which are instead labeled By, and By,, where H and L denote “heavy” and 
“light.” Defining AT; = Tz, — Tp, = 2| g, g; |, the SM prediction (Lenz and Nierste, 2011) 
is AVa/T'a = 42(8) x 1074, which is negligible, while AT,/I, ~ 0.13(3) is small. Both are 
consistent with experiment. 


46 Mixing in the neutral D system has been observed by BaBar, Belle, CDF, and LHCb (e.g., Patrignani, 
2016). The results are consistent with the SM expectations, though the latter have significant long distance 
uncertainties. CP violation is expected to be small in the SM since the mixing is dominated by the first 
two families. New physics implications are considered in (Golowich et al., 2007). 
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Similar to (8.230) and (8.231), we define the strong interaction eigenstates 


|Bi) = |B°) = |db), |Bi) = |B°) = |bd) = -C P| B3) 
|B?) = |b), |B?) = |bs) = -C P| B$), (8.295) 
where we have chosen an ng, = —1 phase convention. The B? and B? are mixed*” by box 


diagrams similar to Figure 8.43, but in this case only the t quark exchange is significant 


and the long-distance corrections are expected to be small. The eigenstates are*® 
|Bu.) = pilB?)+a1B?), |Br:) = pilB?)-4:|B?) i= d,s, (8.296) 
where 
M ag 
i 1— Či - B T? > 
a | . (8.297) 


Pi ~ 1+ či Mbp,5, = ja Bibs 
Ignoring I’, 5,, which is much smaller than Mz, 5, for both Bg and Bs, one finds that q;/p; 
is a pure phase and that 


Am; = my, — Mz, = 2|Mz, 2, |, či 


= 8.298 
Am; cos? pm /2 ym small Am; ( ) 


where m is the phase of Mp, g,- Am; can be measured by observing B} — B? oscillations.*® 


Suppose one starts with an initial state tagged as a B}. The tagging is typically done by 
observing the decay of the other meson, e.g., from a B° — B? or B° — B~ pair. Same-side 
tags, utilizing other b-jet fragments, are also possible. In some cases, such as T (45) > B° B? 
(or ¢ > K} K?), the two are produced in an entangled coherent state, but we will ignore 
that complication. Similar to (8.263), 


I(0)) = |B?) = = (Bad + |Bz.)). (8.299) 


At a later proper time 7, 


{T 


1 ; 
\W(r)) ~ A ome” ü |Br, es" e~ i 


Am; Ami fats imen Tit 
= [cos ( 222 |B?) — isin E, (Br eTe 3 ; 
2 2 Pi 


where m; = (my, + mz,)/2 and we have ignored the lifetime difference. Am; can then 
be obtained by observing the proper time dependence for B? and/or B? decays, e.g., by 
b + cl D (from B?) or b > tve (from B°). The current values are Ama = 0.510(3) 
ps + and Am, = 17.757(21) ps~!. The latter was especially difficult to measure accurately 
(initially by CDF and DO and subsequently by LHCb) because of the much shorter oscillation 
time 27/Amg. 

The standard model calculation from the box diagram yields 


(8.300) 


G2 M2 ^ 
Am = E npm yo Solei) VA Val? Bo, fh, (8.301) 


47 B0 — B? and B° — B? mixing are negligible because the diagonal terms are not degenerate. 

48Some authors reverse the sign of q; in (8.296). 

49Farly studies utilized time-integrated effects, such as the relative number of same-sign and opposite-sign 
lepton pairs resulting from associated B° — B° production. 
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where mpo ~ 5.28 GeV, mgo ~ 5.37 GeV, ng ~ 0.55(1) is the short distance QCD cor- 
rection (Buras, 2005), and So(2;) is defined in (8.262). Bg, and fg, are analogous to Bx 
and fx. From unquenched lattice QCD calculations (Aoki et al., 2017), fB, Be ~ 270(16) 


MeV and fa, By. a2 219(14) MeV. Some of the theoretical uncertainties cancel in the ratio 
€ = 1.24(5). The values for |V;al, |V:s|, and the ratio quoted in Section 8.6.1 utilized (8.301) 
with Vip ~ 1. Alternatively, one can use the Am; as a constraint on the unitarity triangle. 
In the Wolfenstein parametrization 


[Vä Væl? = A208[(1—p)2 +9], ViVa]? = A24, (8.302) 


so that Ama yields the circular annulus centered at (f,7) = (1,0) shown in Figure 8.42. 
The uncertainty is dominated by B Ba fB,> so one can obtain a tighter constraint from the 
ratio Ama/Amsg, also shown in Figure 8.42. The |V,,5| and Amg/Am, constraints intersect 
at (9,7) ~ (0.17, 0.36), with large uncertainties, and the ex hyperbola is consistent. 


C P Asymmetries 


Many C P-violating asymmetries have been measured or searched for in the neutral (and 
charged) B system, which have established indirect and direct CP violation and strongly 
supported the validity of the unitarity triangle predictions (for reviews, see Buras, 2005; 
Nir, 2005; Amhis et al., 2014; Porter, 2016; Gershon and Gligorov, 2017; Patrignani, 2016). 
We will illustrate with one especially clean and important case, i.e., the time-dependent 
asymmetry 

IN Cl a > f)-T(B°(7) > f) 
r(B?(T) > f) +T(B?(T) > f) 


(A similar formalism applies to B? asymmetries.) |B°(r)) and |B°(r)) are defined as 
the states that were initially tagged as B° or B°, and f is a C’P-eigenstate, such as 
J/WKs, ntr, or pKs. One observes asymmetries associated with the interference be- 
tween decays with and without mixing, and in some cases asymmetries associated with the 
decays themselves. Similar to (8.300), 


as(T) = (8.303) 


|B°(r)) = |B°) f(r) + B°) $- (7) 


(8.304) 
|B?) = |B) (r)E + 1B) f0), 
where 
fi (7) = cos (2 =) -imt e- F 
8.305 
f(r) = isin (SRT) mime, man 
and we have again ignored the lifetime difference. Also, define 
Ay = (flHnilB°), As = (f|Hni|B°) (8.306) 


in analogy with (8.283). Ay and Ay may be written 


Ap= 5 Haje, Apc > |Aale=e-*2, (8.307) 


a 
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where the sum is over the diagrams a that contribute to the decay, 6, is the strong final 
state interaction phase associated with Aq, Ya is the weak CP- phase generated (in the SM) 
by the phases in the CKM matrix, and nf = +1 is the CP parity of f, i.e., CP|f) = n,\|f). 
The decay amplitudes at proper time 7 are then 


(f[HnilB°(r)) = As [f+ (7) — Asf- (r) 


z 8.308 
PHL BO) = AE -O Ar), da 

where a 
Ape — 228, 8.309 
(=A (8.309) 

The decay rates are 
0 _ 2 g o Q : -Irr 

T (B° (T) > f) = |Agl? [ey +c- cos(Amr) — Sm àp sin(Amr)] e (8.310) 


T (B°(r) + f) =|Ag? [c+ — c- cos(Am T) + Sm Ap sin(Amr)] e7", 


where c+ = (1  |[Aș|?)/2 and we have used |p/q| ~ 1. (8.310) leads to the asymmetry 


af(T) = Cr cos(Amr) — S¢sin(Amr), (8.311) 
where Ay Sd 
1—|Af 23m Af 

= —__ = — —_. 8.312 

f= Top F5 EDP ee 


(There are a number of conflicting notations for Cy and Sp and their signs in the literature.) 
Since |p/g| ~ 1, a nonzero Cy implies direct C P-breaking in the decay, i.e., |As/A;| 4 1, 
while Sy is the mixing-induced term. 

Especially simple to interpret are decays for which one diagram dominates (or two 
diagrams have the same weak phase). In that case the strong interaction effects cancel, and 
Az is just 
~M 


Àf = ay e = ne aor (8.313) 


where y is the weak phase and —yy, is the phase of q/p, so that 


Cy =0, Sy = Nf sin(Ym + 2%). (8.314) 
In particular, for the golden modes f = J/YKs, (with nf = £1, respectively), one has 
M* N 1/2 ‘ 
1 ( ba) = VaVid _ 5-16, (8.315) 
Pp MBB Ve Via 
as can be seen from Figure 8.41. From Figure 8.45 the contribution of the weak phase is 
; V Veb 
Tie ee d. 8.316 
A a 


Neglecting small effects associated with T gg, CP breaking in the kaon system, and pen- 
guin diagrams”? one obtains Sy/yx,, = +sin(28). The J/YKs,, asymmetries have been 
measured by BaBar, Belle, and LHCb, leading to sin 26 = 0.691(17). As can be seen in Fig- 
ure 8.42 this agrees well with the region in the p— ñ plane determined by the other methods 
(other observations and the global analysis largely eliminate other branches for 28). 


50The gluon penguin diagrams for B° — J/~Ksg,1 are similar to those for B? — K? in Figure 8.45 except 
the s5 pair is replaced by cc. These are expected to be small, and in any case the dominant contributions 
have the same weak phase as the tree diagram. 
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Figure 8.45 Top: tree-level diagram for B° > J/yKs,, (left) and B} —> ntr- 
(right). Bottom: Gluon penguin diagrams for B° + @K° (left) and B° > ntr 
(right). Electroweak penguins involving Z and y are also possible. 


Other decays, e.g., involving b — Gud, b — ciid, or the charmless b + sqq decays 
such as B? —> ¢Kg or 7°Kg can also be used to determine sin 2. All are consistent with 
the J/YKs,z results. In the SM the charmless decays are driven by the b > sqq penguin 
diagrams shown in Figure 8.45. Since the SM amplitudes are loop-suppressed, they are 
a good place to search for the effects of new physics (Buras et al., 2004; Buchalla et al., 
2005), which could enter in penguin diagrams, such as in supersymmetry (e.g., Artuso et al., 
2008a), or at the tree-level, such as in models with an extended Higgs sector or a Z’ with 
flavor changing couplings (e.g., Barger et al., 2004). 

The direct measurement of the angles a and y is more difficult. For example, the asym- 
metry for B} + rz would yield sin 2a if it were due entirely to the tree-level diagram for 
b — duti shown in Figure 8.45. However, there is also a significant contribution from the 
b —> d penguin diagram, which has a different phase. The tree and penguin effects can be 
sorted out with some effort, using different isospin channels (Gronau and London, 1990) 
for B} + ntr, n?n? and Bt > ntr? decays, as well as isospin, polarization, and Dalitz 
plot information on related B — pz and pp decays, yielding a ~ 88(4)°. From (8.228) 7 
does not depend on Vin. It can therefore be measured in the interference of tree-level decay 
amplitudes, e.g., between B7 > D°9K~ + fK- and B + D°K~ — fK~—, where f is a 
final state accessible to both D? and D°, yielding y = 73(7)°. 

Studies of mixing and CP violation in the B? — B? system have been carried out at the 
LHC and Tevatron, especially by LHCb, with generally good agreement”! with the SM (for 
a general review, including both semi-leptonic and nonleptonic decays, see Artuso et al., 
2016). One complication is that AT, is no longer negligible. It is still a good approximation 
to ignore I gg, in (8.297), but (8.300) must be corrected for Tz, # T y,. Similar to B° + 
J/YKs,L, the decays B? + J/Y K+K- (or B? > J/wd) are dominated in the SM by the 


51There is a 3.60 discrepancy from the SM prediction in a DO measurement (Abazov et al., 2014) of the 


dimuon asymmetry (Netet —N¥ H )/sum in the decays of a combination of B° B? and B} B9 pairs. This 
has not yet been confirmed by other experiments. 
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b — c3 tree-level diagram, with a very small penguin contribution. The time-dependent 
CP-asymmetry (including an angular analysis to separate the CP even and odd parts of the 
J/K* K~ state) determines both AT, and yy, ~ —28s, where yy, is the phase of Mg 5, 
and s is defined in (8.228). The result is (Amhis et al., 2014) AT, = 0.084(7) ps~! and 
m, = —0.030(33) rad, consistent with the SM expectations 0.088(20) ps~! and —0.038(1), 
but with a much larger uncertainty. 


The Wolfenstein Parameters 


The constraints from the magnitudes of the CKM elements; K, D, and B mixing; €, €; 
and CP violation in the B decays are generally consistent with the SM predictions and 
with the unitarity triangle, although there is still room for new physics and even hints of 
discrepancies. A global fit (see the CKM article in Patrignani, 2016) to the CKM parameters 
assuming unitarity yields 


\=0.2251(5), A=0.81(3), p=0.12(2), 7 =0.36(1) (8.317) 


as well as the results in (8.215). The J parameter defined in (8.219) is J = 3.0(2) x 1075. For 
more information on the unitarity triangle, see, e.g., (Charles et al., 2005, 2015; Bona et al., 
2006; Amhis et al., 2014; Porter, 2016), as well as the websites listed in the bibliography. 
Future prospects for the unitarity triangle and related physics studies, especially by LHCb 
and at the future Super B factory Belle II at KEK, are reviewed in (Harnew, 2016). 


8.6.5 Time Reversal Violation and Electric Dipole Moments 


Time reversal and CP violation (Section 2.10) are closely related, in the sense that any 
CPT invariant interaction that violates one must violate the other. We are assuming C'PT 
here, so we will continue to use CP and T interchangeably for most theoretical discussions. 
Nevertheless, CP and T transformations are different, and possible T-violating observables 
open a new window on standard model tests and new physics searches (e.g., Bernabéu and 
Martinez-Vidal, 2015). In particular, the effects of true CP violation are essentially limited 
to flavor changing processes such as K and B decays, while some T-odd observables such 
as electric dipole moments are also relevant for flavor-diagonal channels. 

The only direct observations of T violation to date are a difference in the K° > K? and 
K? > K” oscillation probabilities by the CPLEAR collaboration at CERN in 1998 (e.g., 
Angelopoulos et al., 2003), and in differences between, e.g., B° > By and By —> B° 
oscillations, observed by BaBar in 2012 (Lees et al., 2012). The latter experiment involves 
pairs of quantum-entangled neutral B mesons, one of which decays to (+X (B°) or © X 
(B°), and the other to J/pK? (B,) or J/WK (B_), where By are appropriate linear 
combinations of B° and B°. The CPLEAR and BaBar results are consistent with the CP 
asymmetries and with CPT. In principle, T violation could be searched for in neutrino 
oscillations, e.g., by comparing Vu — Ve and Ve > Vu, but this would be extremely difficult. 

Another possibility is to directly observe T violation in scattering or decay processes. 
However, this is very difficult because T not only reverses spins and three-momenta, but 
also interchanges initial and final states. For example, consider the matrix element (f|Hw |i) 
of the weak Hamiltonian (or relevant effective operator) between an initial state i and final 
state f. Using (2.317) on page 59, 


(f|Hwli) = (THIT HwT Ti)" = (Til HwT ITP), (8.318) 


where Ti and Tf have their momenta and spins reversed compared to i and f and also 
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have the roles of “in” and “out” states interchanged (which is relevant if there are initial 
or final state interactions), and we have used that Hw and THwT~! are Hermitian. A 
direct test of whether THwT~t = Hw requires the comparison of two different reactions, 
which is very difficult (or impossible for decays). This argument is readily generalized to 
an arbitrary theory, using the transition matrix element defined by (B.1) on page 511. The 
interaction Hamiltonian in (B.2) is now understood to include all interactions, so that i and 
f are free particle states. If T commutes with the Hamiltonian, then the transition operator 
U = U(+00, —oo) satisfies TUTT! = Ut, so that 


(FIC) = (TFUŻ Tiy = (TilU |T f). (8.319) 


It is, however, possible to search for pseudo T violation by measuring such T odd quanti- 
ties as the triple correlations pi (p2 x p3 ) or 51- (p2 X p3 ), where 5; or pj are spins or momenta 
in the process.5? However, care must be taken with strong or electromagnetic final (or ini- 
tial) state interactions, which can mimic the effects of T violation or at least complicate its 
determination. For definiteness, we will consider the triple product T(P) = pj - (p> X P3). 
Correlations involving spin and more detailed derivations are described in (Gasiorowicz, 
1966; Sozzi, 2008). The expected value of 7 is 


“gy _ LDer@)|ks\Hwl)? 
OP) = FAG wie 


where Dy contains the appropriate phase space and flux factors. Any CP or T violating 
effect requires interference between contributions to the amplitude with different phases, 
analogous to (8.307). Since r(p’) is also odd under space reflection, one needs the interference 
of amplitudes with difference parities. Let us therefore assume that 


(8.320) 


(flHwli) = A@)e*4e%"4 + B(p jelte) eiee, (8.321) 


where we have suppressed the spin indices. 64 g are the strong phases associated, e.g., 
with final state interactions; %4, are the weak phases associated with the T and CP- 
violating parts of the interaction; and the real amplitudes A and B have opposite parities, 
i.e., A(—p’) = A(p’) and B(—p) = —B(p). The phase 7/2 could have been absorbed in dz, 
but is pulled out for convenience. Such a relative phase always occurs in the interference 
between even and odd parity amplitudes when summing over spins, as can be seen, for 
example, in (2.172) on page 36. It is clear that 


(T(P)) x cos(d4 — ôg) sin(ya — ys) +sin(da — dz) cos(ya — YB), (8.322) 


so a non-zero value can be induced by final state interactions (ĝa — 6g #4 0), by T and 
CP violation (p4 — yp # 0), or both. For this reason, it is difficult to isolate T violating 
correlations in processes involving more than one strongly interacting final or initial state 
particle. There is, however, some possibility of observing such effects in cascades in super- 
symmetric theories in which there are mainly leptons and neutralinos in the final state (e.g., 
Bartl et al., 2004; Langacker et al., 2007; Ellis et al., 2009). 

More promising are electric dipole moments (EDMs) d y for particle f (for recent reviews, 
see Ibrahim and Nath, 2008; Roberts et al., 2015; Bernabéu and Martinez-Vidal, 2015), 
defined by its interaction with an electric field E ; 


Hpepm = -dp È. (8.323) 


52Such correlations must be associated with Lorentz invariant quantities, such as Euvpo ph PX PoP; so the 
process must be sufficiently complicated to allow enough independent four-vectors. 
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For a spin-5 field this corresponds to an effective Langrangian density 
= ee 
LEDM = dha by Fv. (8.324) 


If we define the electromagnetic form factors F and gt of wp f” ŲYr analogous to those of 
the proton in (2.379) on page 77, then dy is related to gf by 


f f 
dy = {0 (0) z 4116920) z (8.325) 
2m mf 


where qre and my are the electric charge and mass of f (Problem 2.30). For quarks one can 
also define analogous chromoelectric dipole moments (CEDMs) dÏ by 


—i BEA xi 
LCEDM = zigo" PUG (8.326) 


It is obvious from (8.325) that an electric dipole moment (or chromoelectric dipole 
moment) violates time reversal and space reflection invariance, since by classical reasoning 
dy must be even (odd) under T (P), while § is just the opposite. This extends to any 
non-degenerate system using the Wigner-Eckart theorem, and will be shown formally for 
spin-5 in Appendix G. 

There are stringent experimental limits on possible EDMs for the neutron, paramagnetic 
atoms”? (e.g., ?°°T1), diamagnetic atoms (e.g., °° Hg), and molecules (e.g., Th O) 


Ida] < 3.0 x 107° (90%), |dizg| < 3.1 x 10779 (95%), Idril < 9 x 10~*° (90%) 
Idel] < 9.8 x 107” (90%), |dp| < 7.9 x 107” (95%), (8.327) 


all in units of e-cm. The limits on de and dp are derived from those on ThO and dyg, 
respectively, assuming no other sources of T violation [the limits are weakened somewhat 
if one also allows for T and P-violating eN or mN interactions (e.g., Chupp and Ramsey- 
Musolf, 2015)]. Future experiments may improve the sensitivities significantly. There are 
also prospects for new or greatly improved sensitivities to other EDMs, such as the muon 
or deuteron. 

The predicted EDM effects due to the CKM mixing in the standard model are extremely 
small due to approximate accidental symmetries. The quark EDMs are only generated at 
the three-loop level, and are expected to be of O(107°4 e — cm), while de first enters in four- 
loop diagrams (at least for massless neutrinos) and should be smaller than ~ 10738 e-cm. 
The contributions of the quark EDMs to dn are expected to be negligible, with the largest 
CKM part of d¢*” ~ 107%? e-cm from a two-loop diagram involving a gluon penguin. 
Other CKM contributions to atomic EDMs are also expected to be small. 

One major complication, however, is from the strong CP violation (@gcp) term in 
the QCD Lagrangian density in (5.2), which contributes d? ~ 5 x (10716 — 10715) dgcp 
(Crewther et al., 1979; Kim and Carosi, 2010). One therefore needs |@gcp| £ 10~'°—107"'" 
while CKM effects are expected to shift 6gcp by O(107%). It is not known whether 6gcp 
is small due to an accidental or other cancellation between the bare and shift values, or to 
some dynamical mechanism, and therefore whether or not d? = 0. This strong CP problem 
will be elaborated in Section 10.1. 


53The Schiff theorem (Schiff, 1963) states that atomic EDMs vanish in the non-relativistic limit for a 
pointlike nucleus because of electron screening effects. Fortunately, it is violated by relativistic and finite 
nuclear size effects. 
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Because the SM contributions are expected to be so small (except possibly for d?) EDMs 
are an excellent place to search for the effects of new physics (e.g., Pospelov and Ritz, 
2005; Fukuyama, 2012; Engel et al., 2013). These typically have new CP-violating phases 
and allow EDMs at one-loop level, leading to values already excluded or within reach of 
future EDM experiments. For example, in the MSSM the e~ and quark EDMs and CEDMs 
(which feed into EDMs) can be generated by one-loop vertex diagrams involving neutralinos, 
charginos, gluinos and the fermion scalar partners (e.g., Ibrahim and Nath, 2008; Ellis et al., 
2008), with new CP phases possible from the u and By terms, gaugino masses, and A terms. 
There may also be significant effects from induced three-gluon operators (Weinberg, 1989a), 
and from Higgs exchange and two-loop diagrams involving Higgs fields (Barr and Zee, 1990). 
For large CP phases the existing EDM limits typically require SUSY masses > O(1 TeV) 
unless there are fine-tuned cancellations. Conversely, superpartner masses < 1 TeV would 
require small CP phases, creating some tension with models of electroweak baryogenesis 
(see Section 10.1). At any rate, future experiments have an excellent chance of observing 
EDMs if TeV scale supersymmetry, or many other SM extensions, exists, and would be a 
powerful diagnostic. 


8.6.6 Flavor Changing Neutral Currents (FCNC) 


Flavor changing neutral currents are similar to electric dipole moments and CP violation 
in that they are strongly suppressed by approximate accidental symmetries in the SM, but 
can be much larger in most extensions. 

In the standard model the couplings of the Z to fermions are flavor diagonal at tree 
level because of the GIM mechanism (Glashow et al., 1970), i.e., because all fermions that 
have the same charge, color, and chirality and are therefore able to mix with each other are 
assigned to the same kind of SU (2) x U(1) representation. Off-diagonal Z-fermion couplings 
are induced at loop level, mainly by penguin diagrams such as in Figure 8.44, but these are 
small. The GIM mechanism also significantly suppresses the contributions of box diagrams 
such as in Figure 8.43. Similarly, the couplings of the Higgs to fermions are flavor diagonal 
in the SM because the same transformations that diagonalize the fermion mass matrices 
automatically diagonalize the H Yukawa couplings. This continues to hold in models with 
multiple Higgs doublets provided that only one doublet couples to q} u%, one to g} dh, and 
similarly for the leptons (Glashow and Weinberg, 1968). 

Many types of new physics lead to FCNC, often including new sources of CP viola- 
tion (see, e.g., Langacker, 1991; Artuso et al., 2008a; Altmannshofer et al., 2010; Isidori 
et al., 2010; Mihara et al., 2013; Buras and Girrbach, 2014). In many cases the new effects 
enter at tree level, where they are especially significant because they compete with SM 
loop effects. Other types enter at loop level, but may be enhanced compared with SM loop 
effects by stronger couplings or by not having the same cancellations. In some cases the new 
physics leads to processes that are forbidden or nearly so in the SM. For example, the SM 
exhibits an approximate lepton flavor symmetry: to the extent that the v masses can be 
ignored there are separately conserved Le, L, and L, numbers, with Le = +1 for (e~,ve), 
Le = 0 for (u~,v,,) and (7~,v;), and analogously for L,,,-. Processes such as js > ey are 
therefore essentially forbidden in the SM up to negligible (m,/Mwy)* effects (Cheng and Li, 
1977), but are allowed in many extensions. 

One type of new physics that leads to FCNC effects at tree level involves heavy Z’ 
gauge bosons with GIM-violating family non-universal couplings (Langacker and Plumacher, 
2000), as frequently occur in string constructions (e.g., Blumenhagen et al., 2005). Closely 
related are Kaluza-Klein excitations of neutral gauge bosons in models in which the fermion 
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families are located at different positions in large (O(TeV~‘)) or warped extra dimen- 
sions (Delgado et al., 2000), which therefore have family non-universal couplings. The mix- 
ing of ordinary fermions with heavy ones with exotic SM quantum numbers (e.g., left-chiral 
singlets or right-chiral doublets) can also lead to off-diagonal Z couplings (Langacker and 
London, 1988b). FCNC may also be mediated by heavy gauge boson exchange in alter- 
natives to the Higgs mechanism, such as extended technicolor (Eichten and Lane, 1980; 
Hill and Simmons, 2003; Appelquist et al., 2004), or in models with gauged family sym- 
metries (e.g., Cahn and Harari, 1980). Another possibility is multiple Higgs doublets that 
couple to the same types of fermion (Hall and Weinberg, 1993; Atwood et al., 1997; Branco 
et al., 2012). Leptoquarks are possible particles with couplings to quarks and leptons, such 
as (z a — DER)ILAS, while diquarks have couplings like d Rus de. In this example, the an- 
tilepton fields are expressed in a conjugate form analogous to the conjugate Higgs doublet 
in (8.14), and d isa heavy spin-0 color anti-triplet with electric charge +3. The tilde is 
suggestive that d€ could be the scalar partner of the s° (or the d° for the leptoquark case) 
in R-parity violating versions of supersymmetry (Hewett and Rizzo, 1989). The two types 
of coupling violate lepton number and baryon number, respectively, and rapid proton decay 
would result if both were present simultaneously. Even if one is absent (stabilizing the pro- 
ton) the other could mediate FCNC (Barbier et al., 2005; Dorner et al., 2016). We finally 
mention that some models involving composite quarks or leptons can lead to FCNC, e.g., 
by constituent interchange or configuration changes (Harari, 1984). 

Loop effects in new physics models may also generate significant effective FCNC inter- 
actions, by box diagrams analogous to Figure 8.43 or by penguin-like vertex corrections 
that lead to off-diagonal G, Z, y, or H vertices. These can be important in supersymmetry, 
especially in diagrams involving gluinos (which couple strongly) or in the limit of large tan 8 
(for which the b Yukawa is large), as discussed in Section 10.2.5. Similarly, there can be 
important loop-induced effects in SU (2)z x SU(2)r x U(1) models involving a heavy Wr 
coupling to V+ A (Section 10.3.2), or in extensions involving heavy neutrinos (Section 9.6). 

There are stringent constraints on new sources of FCNC and CP violation from K°— K°, 
D? — D°, and B° — B° mixing (Sections 8.6.3 and 8.6.4). There have also been ex- 
tensive searches for and observations of rare K decays (e.g., Barker and Kettell, 2000; 
Cirigliano et al., 2012). Many focus on lepton flavor-violating decays such as Ky, > u* eF 
or K* —+ a*p*eF, which are completely negligible in the SM. Other decays are strongly 
suppressed in the SM, and serve as probes of the CKM matrix, new physics, CPT violation, 
or long-distance strong interaction physics. These include Kz — 7°¢+¢~, which includes a 
significant direct CP violation component, and the decays K+ + mrtvo and Kr > r°vi, 
which are, respectively, CP-conserving and violating and should be excellent probes of new 
physics (Buras et al., 2015a). 

In addition to mixing and CP violation effects, rare decays of B mesons are especially 
promising for testing the SM and searching for new physics at the tree or loop level (Blake 
et al., 2015; Ali, 2016). This is in part because the charmless modes are strongly sup- 
pressed in the SM and in part because some types of new physics couple most strongly to 
heavier particles. For example, the inclusive radiative decays B —> X,7, where X, carries 
strangeness, has a branching ratio 3.43(22) x 1074, in agreement with the SM expectation 
3.36(23) x 10-4, whereas the rate could have been strongly enhanced by the existence of 
a charged Higgs boson in some parameter regions of two-doublet models. Similarly, CMS 
and LHCb have recently measured B(B° — u* u~) = 2.8(7) x 107° (Khachatryan et al., 
2015a), within about lo of the SM prediction 3.55(23) x 1079, excluding some parameter 
regions of supersymmetry. However, their constraint B(B° > wtp) = 3.9(1.5) x 1071° 
is higher than the expectation 1.01(9) x 1071? by about 2c. Although this is not by it- 
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self significant, there are a number of other hints of discrepancies in B decays at the 
(2 — 4)o level. These include deviations at the (2 — 3)o level (depending on the theo- 
retical modeling) in the angular analysis of the exclusive B + K*y*p~ decays observed 
by LHCb and Belle. There is also a possibility of lepton-flavor nonuniversality in the ratio 
(B= > K* utu )/T(B} > K*ete—) = 0.745(97) measured by LHCb, about 2.60 below 
the SM value ~ 1, suggesting, e.g., a heavy Z’. (The absolute rate for K*ptp is also 
~ 1.80 low.) Similarly, the charged current decays B > D“)rv as measured by BaBar, 
Belle, and LHCb appear to violate universality by ~ 3.90 when compared to those for 
B => D®) 4v, l =e, or u, suggesting a leptoquark contribution. None of these hints of new 
physics are compelling, but together they hint at interesting things to come. 

Similarly, there have been extensive searches for lepton flavor-violation, such as the 
leptonic processes u — 3e, u — ey, p — e conversion in interactions with a nucleus, 
and analogous 7 decays (Mihara et al., 2013; de Gouvéa and Vogel, 2013; Pich, 2014; 
Gorringe and Hertzog, 2015). For example, the MEG collaboration at PSI has recently 
obtained (Baldini et al., 2016) B(ut — ety) < 4.2 x 10713 at 90% c.l. There have also 
been stringent limits on lepton-flavor violation in B, D, and Z decays. Current limits on 
lepton-flavor violation in Higgs decay are weak (e.g., Khachatryan et al., 2015b; Aad et al., 
2015b), but should become important in the future. 

As will be described in Chapter 10 there are good reasons to suspect that new physics 
emerges at the TeV scale. As emphasized above, most types are likely to lead to observable 
FCNC, CP violation, and EDM effects. In fact, it is surprising that no such effects have 
yet been observed. The constraints on new physics relevant to FCNC can be parametrized 
by effective higher-dimensional operators, especially four-fermi operators (e.g., Buchmuller 
and Wyler, 1986; Grzadkowski et al., 2010) such as 


AS|= Cab ; 7 7 
> p2 (dasa) (douso) + h.c. (8.328) 
a,b=L,R 9% 


For example, Amg and e (defined in (8.270)) imply (Nir, 2015) 


Arr > 10° GeV [Amg], Arr 


y |cLL] [Sm ciL] 


assuming only that the new physics part is not larger than the experimental values. There 
are even stronger limits on the LR operators. The analogous limits from B and D mixing are 
in the 105 — 10° GeV range. The implication is that new physics at the A ~ TeV scale should 
have been seen by now unless the coefficients like Cap are very small. This could be due to 
very weak coupling, but the motivations for new TeV physics such as the Higgs/hierarchy 
problem or gauge unification suggest that it should have at least electroweak scale coupling. 
One therefore presumably requires that the coefficients of the flavor changing operators are 
strongly suppressed, similar to those from the SM (e.g., the factor of m2|Vea|?|Ves|?/My, in 
(8.237)). Since most new physics models do not have such strong suppressions, there is a 
strong tension between attempts to solve the Higgs/hierarchy problem and constraints from 
FCNC, CP, and EDMs. This tension has led to much recent discussion of minimal flavor 
violation (MFV), which is the hypothesis that all flavor violation, even that which is associ- 
ated with new physics, is proportional to the standard model Yukawa matrices (D’ Ambrosio 
et al., 2002; Nir, 2015), leading to a significant suppression of flavor changing effects simi- 
lar to the SM. Examples include supersymmetry with anomaly or (simple forms of) gauge 
mediation of supersymmetry breaking (e.g., Chung et al., 2005). 

We have already emphasized that for massless neutrinos the SM possesses an accidental 
lepton flavor symmetry. An extension of this idea, which motivates minimal flavor violation, 


> 2x 10’ GeV [e], (8.329) 
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is that there are no flavor changing or CP violation effects at all in the SM when one turns 
off the Yukawa interactions (9g¢p becomes unobservable as well). In fact, in that limit 
there is a global U (3)? flavor symmetry associated with the three families each of qz, £L, 
ur, dr, and er (U(3)® if we include the vg), with the diagonal generators corresponding 
to, e.g., conserved ur, cr, and tr numbers. Of course, this symmetry is strongly broken by 
the third family Yukawas, but those of the first two families are small. 

It is still uncertain whether MFV is employed by nature, or whether FCNC are sup- 
pressed by some other mechanism. It is clear, however, that FCNC, EDMs, CP violation, 
and rare decays have an enormous reach in searching for new physics. Effects involving 
the ¢ quark and other third family members are especially interesting (Section 7.2.7), since 
it is so much heavier than the other two and may play a role in new physics, especially 
new physics associated with spontaneous symmetry breaking. Experiments at the LHC and 
lower-energy facilities, and at possible future colliders, will study and search for t quark 
decays; rare u, T, K, D, and B decays and processes; rare and flavor-changing Z and H 
decays; and EDMs. Lepton number and lepton flavor violation associated with neutrinos 
will be further discussed in Chapter 9, and baryon number violation, e.g., baryogenesis, 
proton decay, and neutron oscillations, in Chapter 10. 


8.7 PROBLEMS 


8.1 Consider a generalization of the SU(2) x U(1) model involving k multiplets ¢;,i = 
1---k, of complex scalars. The dimension of the i” multiplet is 2t; + 1, where t; can be 
0,1/2,1,3/2---, and the elements have T? eigenvalues t? = —t;, —t; +1---t; (cf., the rota- 
tion group). Also, the ¿i multiplet has weak hypercharge y;. Assume that each multiplet 
has one electrically neutral component ¢?, i.e., with q; = t? + y; = 0, and that that compo- 
nent acquires a vacuum expectation value (¢°) = v; / v2. 

(a) Show that the mass eigenstates W+, Z, and A are the same as in the standard model. 
(b) Calculate the W and Z masses in terms of g, g’, ti, t?, and vi. 

(c) The po parameter, po = Mj, /(M? cos? Oy) is predicted to be unity at the tree level in 
the standard model and in extensions involving additional Higgs doublets. Show that in the 
more general case 


k 
hie My _ Doverlti(ti +1) - (lr? 
Ma coe Oy 2 Din (Pll 


(d) Specialize to the case of one doublet and two triplets 


ot t+ Z+ 
o) a ( o? ) , ® = or ’ X = = $ 
p° D- 


where vg = V2(¢°) > ve = V2(®°) and vg > vs = V2(X?}). Calculate po to leading 
nontrivial order in ve/vg and vy /V¢. 

(e) Now consider the case of multiple Higgs doublets but no higher-dimensional representa- 
tions. Argue that the couplings of neutral physical Higgs bosons to fermions will no longer 
be flavor-diagonal. (Do not attempt to write the Higgs potential or find the exact Higgs 
mass eigenstates. ) 


8.2 Verify that there are no Y or Y? anomalies in the SM. 
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8.3 In the SM with a single Higgs doublet ¢ one can always perform an SU(2) x U(1) 
transformation so that (¢) = -& 3 J with v real. Therefore, SU (2) x U(1) > U(1)Q 


and there is a conserved electric charge, Q. However, for two or more doublets with the 
same hypercharge yn = +4 one has 


Vn { exp(ipn) sin an 

V2 \ exp(ion) COs An 
where a1, 01, and pı can be chosen to be zero by an SU(2) x U(1) transformation, but the 
other angles are determined by the potential. If any of the a, are non-zero then SU(2) x U(1) 
is completely broken and there is no conserved electric charge. Nontrivial values of on and pn 
may be associated with CP violation, although in some cases they can be rotated away by 
field redefinitions. Analyze the two doublet case and show under what conditions it leads to 
the spontaneous breaking of U(1)g. To simplify the analysis: consider only renormalizable 
terms, impose CP invariance (i.e., assume that the coefficients in the potential are real), 
impose the Z2 symmetry ¢; —> —¢ with ¢2  +¢2, ignore terms involving the “tilde fields” 
analogous to (8.14), and assume that the parameters are such that vı 4 0, v2 4 0. Hint: it 
is not necessary to actually determine the values of the vi. 


8.4 Suppose the scalar sector of the standard model is extended, so that it includes not 
only the ordinary Higgs doublet ¢, but also a new complex field ø. Assume o transforms as 
a singlet under the SU(2) gauge group (and also under SU(3) of color), but that it carries 
weak hypercharge yz = 1 and therefore electric charge qo = 1. 

(a) What is the most general renormalizable gauge invariant potential V (¢, 0)? 

(b) Show that there are some choices of parameters for which not only ¢ but also ø will 
have non-zero vacuum expectation values. 

(c) Suppose that the parameters are such that in unitary gauge, 


75-A? a ee ee )’ 


where vo = V2(c) and v = V/2($°) are the (real) vacuum expectation values, and © and 
H are physical real scalars of definite mass. Assume that vy, < v. Show that the photon 
acquires a small mass, and calculate it to leading nontrivial order in v,/v. 

(d) Show that Z can decay into y} if it is kinematically allowed, and calculate the rate to 
leading nonzero order in v,/v. 


8.5 Add a non-chiral lepton doublet Zz, R with y = —4, to the SU(2) x U(1) theory, i.e., 


(E), (E)n 


(a) Write the Lagrangian density (before SSB) for £ = Z; + £r, including all allowed kinetic 
energy, gauge interaction, mass, and Yukawa terms. 

(b) Now turn on SSB. Display the couplings of £ to WF, A, and Z and to the Higgs scalar 
H. 


8.6 The mass term for three chiral fermions Y? p = (Y?r r YIL R YSL R)” iS 


1 a 0.5 
-L= PL MYS + h.c., where M = 0 3+ i 2 
0.25 7 6 
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Find the physical mass eigenvalues m; and unitary matrices Az rR for which AM AR = 
diag(mı mz M3) (analogous to (8.47)). Make sure that the m; are real and nonnegative. 
Hint: use any convenient numerical package. 


8.7 Let y? and ws be two fermion fields, and define 


0 
wa (Up ) sub +ue 


Y3 
Let 7 E 
L = Pip’ — (icp TY? + h.c.) , 
where c is real and positive and I = ; ; . Calculate the physical fermion masses, and 


express the mass eigenstate fields Y%z,r in terms of Y? p. 


8.8 Suppose one adds to the standard model an exotic non-chiral pair of charged leptons, 
Ezp which are both SU (2) singlets with yg = —1. Then, ignoring the second and third 
families, the leptons are 


v? o) 0 0 
( —0 ) ER Ey ER - 


The mass matrix for the charged leptons is (dropping the superscript —) 


0 
0 ZF x y ER 
-c= (& 82) (5 BY (ga ) the 


where x and y are generated by the VEVs of the Higgs doublet, and A and B are bare 
masses (or can be generated by the VEVs of an SU(2) singlet Higgs). e?, and E9, have the 
same quantum numbers, so w.l.o.g. we can take linear combinations such that A = 0. We 
also assume that the parameters are real with B > x,y. 

(a) Find the mass eigenvalues and eigenstates to order «/B and y/B. 

(b) Find the weak neutral current J4 to the lowest nontrivial order in x/B and y/B for 
each term, and show that it includes flavor changing components. 


8.9 A hypothetical future accelerator allows the collisions of e~ with Higgs particles. 

(a) Draw the tree-level diagrams for e~ H — e~ Z, and show that only one of them is nonzero 
for Mme = 0. 

(b) Calculate the spin-averaged center of mass differential cross section for e` H — e~ Z as 
a function of the CM scattering angle 0, s = Eig, Mz, My, g, and 0w. Take me = 0. 


8.10 Suppose the SU(2) x U(1) model is extended by an additional global or gauge U(1) 
symmetry that forbids an elementary down Yukawa, i.e., T? in (8.13) must vanish. Introduce 
a vector pair of SU(2) singlets Dz,r and a complex scalar ø with charges that allow 


-L= TPJLODR + MpD,Dr + koDrdr + h.c. 


(a) Display possible U (1) charges consistent with £. 
(b) For large Mp one can integrate out the D. Show that this leads to a higher-dimension 
operator that generates an effective Yukawa coupling for dr when (o) # 0. 
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8.11 Use the Ciu, a part of the parity-violating e-hadron interaction in (8.118) to calcu- 
late the corresponding non-relativistic potential in (8.122). Use the formalism developed in 
Problem 2.33. 


8.12 Verify the expression in (8.173) for the forward-backward asymmetry in e~e+ > ff 
at the Z-pole directly using trace techniques. Neglect the fermion masses. Hint: it is not 
required to calculate of and og separately, only the combination App. 


8.13 (a) Find the spin and color-averaged da/dz for q-G- > p“ u* and Ẹrqr —> pt, due 

to s-channel y and Z exchange, where z is the cosine of the scattering angle between qr and 
ye. This can be obtained by appropriately modifying the results in (8.125)—(8.131). 
(b) Now consider pp > u` pu™ at the LHC, where y is the rapidity of the y` u” pair and 
V8 and 2 are respectively the total energy and cos@ in the ppt CM. Since both gg, 
and q;q, can contribute it is conventional to define 6 with respect to the direction of the 
rapidity, i.e., along pa(ps) for y > 0(< 0). Find da/dydz near the Z-pole in the narrow 
width approximation in terms of the luminosity functions, analogous to (6.28) and (6.29). 
(c) Find expressions for the forward-backward asymmetry 


1 35 do 0 4, do 

_ Jo dz dydé Ja dz dyd 
~ plo. de 0 Jy da’ 
Jo dê gyd ae A dz Fide 


and for Arp(y1, y2), in which the numerator and denorminator are each integrated over y 
from 0 < yı < ye < In(\/s/Mz). Note that these are the same as the charge asymmetries 


Ars(y) 


) = 2 (lyn < lY+ 1) 
)+ F(1Yp-1 < lyp) 


F([Yp-| > Yu 
alyp- | > [Yat 


Ac = 


(d) Plot Arg(y) and calculate Ar g(0,ln(ys/Mz)) at the LHC for ys = 14 TeV. 


8.14 (a) Write the tree-level amplitudes for e~ (pı)e* (p2) > W- (p3)W* (pa) correspond- 
ing to the three diagrams in Figure 8.29. Neglect me and my., but not My or Mz. 

(b) Use any convenient computer algebra program to calculate and plot the spin-averaged 
cross sections in pb as a function of ys from 2My to 200 GeV, including just the ve, the ve 
and y, and all three diagrams, and show that they roughly agree with Figure 8.30 (which 
includes higher-order corrections). 


8.15 From (8.55), the interaction Lagrangian for the coupling of the top quark t to the 
bottom quark b and the Wt is 


L= —gW,, by" (1 — 9 )t + h.c., 


where g = gV;,5/2V2. 

(a) Calculate the differential decay rate dr /dcos0 for t + bW in the t rest frame, where 
6 is the angle between the t spin direction and the b momentum. Sum over the b and WT 
spins. Neglect ma but keep My. 

(b) Calculate the numerical value (in s~') for the total t decay rate into bW*, ignoring mp. 
(c) Calculate the fractions Fo, F}, and F_ of all t decays that are into a W with helicity 0 
(longitudinal), +1 (R) , or —1 (L), respectively. 

(d) One of the Fo, + should vanish. Interpret that fact in terms of angular momentum. 


The Standard Electroweak Theory 371 


8.16 Show that the effective interaction in (8.192) reproduces the SM result in (8.190) for 
the couplings in (8.193). 


8.17 Integrate the one-loop RGE in (5.33) and (8.199) numerically (using Mathematica, 
Maple, Fortran, C++, etc) and verify qualitatively the upper and lower limits on My in 
Figure 8.32. 


8.18 The renormalization group equations sometimes have infrared stable or ultraviolet 
stable fixed points, which are constant values for couplings or their ratios that are ap- 
proached asymptotically for Q? — 0 or Q? —> oo. As a simple example, show that the ratio 
hi(Q?)/gs(Q”) has an infrared stable fixed point in the standard model if one neglects g 
and g’, and find its value. The relevant RGE equations are given in (5.33) and (8.199). (The 
exact solution can be found in (Pendleton and Ross, 1981).) In practice the fixed point may 
not be reached until Q? is too small for the one-loop equations to be valid. 


8.19 Derive the decay rate for H — GG in the standard model from the effective La- 


grangian 
Qs 


12rv 


Left = HG G" 


and show that it agrees with (8.212). 


8.20 The discovery of the Higgs boson at the LHC involved pp scattering at 7 and 8 TeV, 
with ATLAS and CMS each accumulating ~ 5 fb~! of luminosity at each energy. To model 
this, consider the production and decay of a 125 GeV Higgs at 8 TeV for L = 10 fb™t. 

(a) Calculate the cross sections at LO for the dominant production models (GG > H, 
WtW- —> H, ZZ > H, associated W+H, and assocated ZH production). Compare these 
with the full (NNLO (QCD)+ NLO (EW)) results in Figure 8.35. Use the narrow width 
approximation for gluon-gluon fusion (GGF). For vector boson fusion (VBF) use the equiv- 
alent W approximation, which can be inferred from the Cross-Section article in (Patrignani, 
2016) (although it tends to overestimate). The Higgstrahlung cross sections can be directly 
calculated or can be taken from the same article. 

(b) Estimate the total number of signal events for the discovery channels H —> yy and 
H > ZZ* — Af, i.e., 4e, 4u, or 2e + 2u. 

Hint: Use the theoretical SM values B(H — yy) ~ 0.0023, B(H > ZZ*) ~ 0.026, 
B(H + GG) ~ 0.086, Ty ~ 4.1 MeV, and I'(Z > ete) ~ T(Z => utu) ~ 0.034. 


8.21 Suppose that the standard model were extended by a fourth chiral family (¢’, b’, €’, 
v’) with masses (including that of the v’) >> my. Estimate how the rates for pp > H > ZZ* 
and pp > H — yy would be changed relative to the SM, assuming production by gluon 
fusion. Hint: assume that the widths for GG and yy can be scaled from (8.212) and (8.213). 


8.22 Modifications of the Higgs interactions due to new physics can often be described by 
higher-dimensional operators (HDOs). As a simple model, consider a Hermitian scalar field 
ġo with 


1 À 
Lo = 5 (8.6)? + ZOPP -VO V(b) = Fe? + Sot + Eo, 


with u? < 0 and \ > 0. The renormalizable part of Lẹ models the Higgs component 3 


in (8.18) in unitary gauge. The coefficients o and p of the NROs have dimensions mass~?. 
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They are assumed to be nonnegative and small. We will see that they not only induce 
new interaction vertices, but also modify the strength of the 3- and 4-point vertices when 
expressed in terms of the observable VEV and scalar mass. 

(a) First take o = p = 0. Show that v? = —y?/\ and MÈ, = 2\v?, where v = (¢) and the 
“Higgs scalar” is H = ¢—v. Show that the coefficients of H? and H* in V are respectively 
M?,/2v and M?,/8v*. Hint: substitute u? = —Av? in the expression for V(H + v). 

(b) Now take o = 0, p £0. Show that v? = (~A + y A2 — 4p u2) /2p. Calculate M7, and the 
coefficients of H”, n = 3,4,5,6, in terms of Mz, v, and p. It is again useful to substitute 
the expression for u? in V(H +v). 

(c) For o £0, p = 0, one should perform a field redefinition ¢ = KH + v, where v = (¢) = 
—?/X and « is chosen so that the kinetic energy term for H is canonical, i.e., $(0,H)?. 
Determine, K, Mj,, and the interactions in terms of My, v, and ø to linear order in ø. Hint: 
there are terms x H?(0,H)? and H(0,H)?, as well as H? and H*. 


8.23 Calculate the amplitude and cross section for ZŁ ZŁ — wr W,, at high energy using 
the equivalence theorem. 


8.24 Use the equivalence theorem to rederive the leading term in m;/Mvy for the polarized 
differential decay rate for t > bW* considered in Problem 8.15a. Neglect mp. 


8.25 Derive (8.210) on page 331, and use the equivalence theorem to verify the leading 
term for large My /My. 


8.26 Choose the phase conventions for the K? and K? fields according to the SU(3) 
convention in (3.91), and express the pseudoscalar octet fields in terms of the quark fields 
by @ = —igy7°4, where q = (ud s)" (as in (5.102)). Calculate the CP phase ng in (8.231) 
in terms of the C'P phases for the d and s quarks defined in Section 2.10. Show that ng = —1 
under the additional convention that the d and s have the same CP phases. 


8.27 Derive (8.234), (8.235), and the statement that the Iz, = 0,2 states must have 
nonzero internal angular momenta. 


8.28 Derive (8.244) using the Fierz identities in (2.215) on page 43. 


CHAPTER 9 


Neutrino Mass and Mixing 
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Neutrinos are a unique probe of many aspects of physics, geophysics, and astrophysics on 
scales ranging from 10733 to 10+?8 cm. Neutrino scattering and decays involving neutrinos 
have been essential in establishing the Fermi theory and parity violation, determining the 
elements of the CKM matrix, and testing the weak neutral current predictions of the stan- 
dard model, and therefore played a significant role in the precision electroweak program as 
described in Section 8.3.6. Deep inelastic scattering involving neutrinos and charged leptons 
has also been critical in establishing the existence and properties of quarks, the structure 
of the nucleon, and the predictions of the short distance behavior of QCD. Similarly, neu- 
trinos are important for the physics and/or probes of the Sun, Earth, stars, core-collapse 
supernovae, the origins of cosmic rays, the large scale structure of the universe, big bang 
nucleosynthesis, and possibly baryogenesis. 

Neutrinos are also interesting because their masses are so tiny and because, unlike the 
quarks, some of the leptonic mixing angles are large. Small neutrino masses are sensitive to 
new physics at scales ranging from a TeV up to the Planck scale, but because of their unusual 
nature there is a good chance that they are somehow connected with the latter, possibly 
shedding light on an underlying grand unification or superstring theory. The neutrinos are 
also unique in that they do not carry either color or electric charge. It is therefore possible 
(and many physicists think probable) that their masses are Majorana (lepton number vio- 
lating) rather than Dirac (lepton number conserving, analogous to the quark and charged 
lepton masses). Establishing the nature of the neutrino masses, as well as understanding 
the origin of the small masses and large mixings, is of fundamental interest. 

The original version of the SU(2) x U(1) model did not have any mechanism to generate 
nonzero masses at the renormalizable level. However, it is straightforward to extend the 
original model by the addition of SU(2)-singlet right-chiral neutrinos,’ allowing Dirac mass 
terms. These could yield light Dirac neutrinos if the Yukawa couplings are extremely small, 
as could occur, for example, if the Yukawa couplings are forbidden at tree-level by some new 
symmetry. Alternatively, SU(2)-singlet neutrinos could lead to light Majorana neutrinos 
through the seesaw mechanism. One could also generate small Majorana masses without 
right-handed neutrinos via extended Higgs sectors or higher-dimension operators. Most 
extensions of the standard model (with the notable exception of the minimal SU(5) grand 
unified theory) involve either SU(2)-singlet neutrinos or extended Higgs sectors, though 
they do not necessarily explain the smallness of the masses. 

In this chapter we review the basic issues related to the neutrino masses and mixings, 
the major classes of models, and some of the experiments. More detailed discussions may 


1 Also referred to as singlet, right-handed, or sterile neutrinos. 
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be found in a number of books (Bahcall, 1989; Langacker, 2000; Mohapatra and Pal, 2004; 
Giunti and Kim, 2007; Bilenky, 2010; Xing and Zhou, 2011; Barger et al., 2012; Zuber, 
2012; Valle and Romao, 2015; Suekane, 2015) and review articles (Raffelt, 1999; Dolgov, 
2002; Strumia and Vissani, 2006; Mohapatra et al., 2007; Gonzalez-Garcia and Maltoni, 
2008; Camilleri et al., 2008; de Gouvéa et al., 2013; Patrignani, 2016). 


9.1 BASIC CONCEPTS FOR NEUTRINO MASS 


9.1.1 Active and Sterile Neutrinos 


We saw in Chapter 2 that the minimal fermionic degree of freedom is a Weyl two-component 
field, as defined in (2.200) on page 41 and in Section 2.11. A Wey] field can be represented 
Vr 


either in four-component notation, e.g., by Yr = Pry = ( 0 


h or in two-component 


notation as Yz. We will use both in this section. 

It is useful to first distinguish between active and sterile neutrinos. Active (a.k.a. ordi- 
nary or doublet) neutrinos are left-chiral Weyl neutrinos that transform as SU(2) doublets 
with a charged lepton partner, and which therefore have normal weak interactions. The 
electron-type L doublet and its right-handed partner are 


a (hes Bal % 9.1 
ne Nie cp R= eNi (9.1) 
in four-component notation, where Yh = CHE is the field related by CP to Wr up to y 
matrices and a possible CP phase, as in (2.306) on page 57 (or (2.333) in two-component 
notation). We have also carried out a “tilde” transformation on the SU(2) doublet indices, 
analogous to the one for the Higgs in (8.14) on page 260, so that Č transforms as a 2 rather 
than a 2*. We reemphasize that we define jy = (Yr)? 7° = (Pr)! 7°, i.e., the Dirac adjoint 
acts on wz and not on wv. 

Sterile (a.k.a. singlet or “right-handed” ) neutrinos, which are present in most extensions 
of the SM, are SU(2) singlets. They do not interact except by mixing, Yukawa interactions, 
or beyond the SM (BSM) interactions. In four-component notation, a sterile right-chiral 
Wey] field will be written as vp and its conjugate as Vf, 


VR T Vi (9.2) 


In two-component notation, the L and R chiral fields will be written as Mz and NR, re- 
spectively, with their CP conjugates M and N€: 


Ni ee Np (active), NR Po NẸ (sterile). (9.3) 


9.1.2 Dirac and Majorana Masses 
Dirac Masses 


A fermion mass term converts a Weyl field of one chirality into one of the opposite chirality, 
-L = m (ardor + VorVaL) =m (vitr +Ul, War) f (9.4) 


as in (2.347) on page 63. (We have taken m to be real.) A physical interpretation is that a 
massless fermion has the same helicity (chirality) in all frames of reference, while that of a 
massive particle depends on the reference frame and therefore can be flipped. 
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A Dirac mass connects two distinct Weyl fields, i.e., Upp 4 Yp. For a single type of 
neutrino, a Dirac mass connects an active neutrino with a sterile one,” 


—Lp =™Mp (LUR + Drv) = MpVpYp 


= mp (NEN +NEN2), we) 
where vp = vz +p is a Dirac field. It has four distinct components, vz, Vp, vp and vz, and 
there is a conserved fermion number (or lepton number L in this case), corresponding to the 
global phase symmetry vL, R —> eB vL r. This L conservation ensures that there is no mixing 
between vz and vf, or between vpr and vh. However, when embedded in the SM context the 
neutrinos are chiral, so mp violates the third component T? of weak isospin by At}? = 3. 
It can be generated by the Higgs mechanism, as described in Section 8.2.1 and illustrated 
in Figure 9.1, and is in principle analogous to the quark and charged lepton masses. Dirac 
masses can be easily generalized to three or more families. However, the tiny values of the 
neutrino masses require that the Higgs Yukawa couplings h, = m,/v defined in (8.46) on 
page 267 would have to be extremely small if they are due to a simple Dirac-type Higgs 
coupling: m, ~ 0.1 eV would correspond to h, ~ 10712, for example, to be compared with 
the t quark coupling hy = O(1) or the electron coupling he ~ 1075. Of course, we do not 
understand the ratio h./h; either, so some caution should be taken with such statements. 
In any case, most particle physicists believe that either an alternative mechanism or some 
explanation for a small h, is needed, as will be discussed below. 


h 
VL = j 
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Figure 9.1 Mechanisms for generating a Dirac neutrino mass. Left: an elementary 
Yukawa coupling to the neutral Higgs doublet field ¢°. Right: a higher-dimensional 
operator leading to a suppressed Yukawa coupling. 


Majorana Masses 


Majorana mass terms are more economical in that they only require a single Weyl field, 
i.e., Vpr = Y£ p in (9.4). They are not as familiar as Dirac mass terms because they violate 
fermion number by two units. For the quarks and charged leptons such mass terms are 
forbidden because they would violate color and/or electric charge. However, the neutrinos 
do not carry any unbroken gauge quantum numbers, so Majorana masses are a possibility. 

For an active neutrino, a Majorana mass term describes a transition between a left- 
handed neutrino and its conjugate right-handed antineutrino. In four-component language, 
it can be written 


mMm 
—Lr = > (DLV + DRVL) = > (PCDF + vi Cvr) = > Mee. (9.6) 


?There are variant forms of Dirac neutrino masses involving two distinct active or two distinct sterile 
neutrinos, as will be discussed below. 
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As is clear from the second form, £r can be viewed as the annihilation or creation of two 
neutrinos, and therefore violates lepton number by two units, AL = 2. In the last form in 
(9.6), vm = vr + V% is a self-conjugate? two-component (Majorana) field satisfying* vm = 
vS, = Cvi,. A Majorana v is therefore its own antiparticle and can mediate neutrinoless 
double beta decay (Go), in which two neutrons turn into two protons and two electrons, 
violating lepton number by two units, as shown in Figure 9.2. A Majorana mass for an 
active neutrino also violates weak isospin by one unit, At? = 1 (hence the subscript T 
for triplet), and can be generated either by the VEV of a Higgs triplet or by a higher- 
dimensional operator involving two Higgs doublets (such as the minimal seesaw model), as 
in Figure 9.3. The 4 in Lr is needed to yield the correct expression for the Hamiltonian. It 
is somewhat analogous to the extra 4 in the free-field Lagrangian density for a Hermitian 


2 
scalar. This is more obvious from the kinetic energy term 


1. 1.2 one 
LKE = zŪmi vm) = 5 (PLi Øvr + VRi PVR) = PLi UL, (9.7) 


where the two terms are equal because of (2.302) on page 56. The Majorana mass term can 
be rewritten in two-component language using (2.331) as 


-Lr = TE (MNG + NGNL) = TE (Mio? NE — NF io?N,) . (9.8) 
P .P 


Figure9.2_ Diagram for neutrinoless double beta decay ((3o,). For a single Majorana 
neutrino, mgg is just mr in (9.6). 


A sterile neutrino can also have a Majorana mass term of the form, 


m m m 
-Ls = -> (Dive + Prvi) = > (DLCDI + vi Cvi) =P PmsYms» (9.9) 
where Vms = V} + Vr = Vus. In this case, weak isospin is conserved, At? = 0, so mg can 
be generated by the VEV of a Higgs singlet.” In two-component form 
m m 
-Ls = 78 (Nir + NİNE) = "5 (Nitio®N Et -NET io?NE) . (9.10) 


3Unlike a Hermitian scalar, a Majorana state still has two helicities, corresponding to vr, and vp. They 
only mix by the Majorana mass term, so there is still an approximately conserved lepton number to the 
extent that mr is small. For example, there could be a cosmological asymmetry between vz and vh, even 
for Majorana masses, if the rate for transitions between them is sufficiently slow compared to the age of the 
universe (Barger et al., 2003a). 

4We have taken a convention in which 7, = 1 for the CP phase in (2.306). More generally, vm = 
VL +N YR =n Cola where vR is still defined as cor. 

5mg could also be generated in principle by a bare mass, but this is usually forbidden by additional 
symmetries in extensions of the SM. 
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Figure 9.3 Mechanisms for a Majorana mass term. Top left: coupling to a neutral 
Higgs triplet field ¢%. Top right: a higher-dimensional operator coupling to two 
Higgs doublets. Botton left: the minimal seesaw mechanism (a specific implementa- 
tion of the higher-dimensional operator), in which a light active neutrino mixes with 
a very heavy sterile Majorana neutrino. Bottom right: a loop diagram involving a 
charged scalar field A7. 


The four-component fields for Dirac and for active and sterile Majorana neutrinos can 
therefore be written 


NES Re 
vD = NR ~ io? N 


(3) (uag) om = (36 = ug) 
MEN JAT N J OMS \ Ne) UNS j’ 


where two of the components are not independent in the Majorana cases. 
The free-field equations of motion with a Majorana mass term obtained from the Euler- 
Lagrange equation (2.19) on page 9 are 


(9.11) 


i vr —mrCr} = 0, ið Nr — mrio’ N$ =0. (9.12) 


This leads to the free-field expression 


37 . 
vula) = | gam È MBS) alps)? + 0H, s)at (pse, (913) 
P s=+,— 


which is of the same form as the free Dirac field in (2.159) on page 35 except that there is 
no distinction between a and b operators. Any spin basis can be used, but the helicity basis 
is usually most convenient. 

A more compact notation for dealing with Majorana masses and fields will be developed 
in Section 10.2.2. 
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A Comment on Phases 


We implicitly assumed that the masses mp and mr in (9.5) and (9.6) are real and positive. 
More generally, however, they can be negative or complex, but can be made real and positive 
by field redefinitions. In the Dirac case for an arbitrary w, one has generally 


Lp = mpýrýr + mpýrýL = mpY} r + mph Yz. (9.14) 


There is freedom to redefine both Yr, and wr by separate phase transformations to remove 
any phase in mp and make it positive (see Section 3.3.5 and Problem 3.32). In the SM, 
only the WCC interactions depend on the phases of the left-chiral fermion fields, and no SM 
interaction involves the right chiral phases. It is therefore convenient to choose the phases 
of the L-chiral mass eigenstates (pz in the simple example in (9.14)) by any convenient 
convention, and then adjust the wp fields to make the mass eigenvalues real and positive. 
This was done in Section 8.2.2 to remove unobservable phases from the CKM matrix, and 
a similar procedure can be applied to the lepton mixing if there are only Dirac masses. 
However, for a general Majorana mass term, 


—LM = ; (mmp? + muYICyL) = ; (mur V} io? — mý YTio?’Ys) : (9.15) 
there is only one independent field Yz (or Yz). One usually chooses to use the phase freedom 
in wy to make my, real and positive, in which case there is no remaining freedom. This will 
imply the existence of additional Majorana phases in the leptonic mixing matrix Ve for the 
WCC for the case of Majorana neutrino masses. However, such phases are only observable 
in processes such as Boy that involve the phases of the neutrino masses explicitly, as can 
be seen by working in an alternative convention of a simple Vz; but leaving mwm complex. 

Even after a phase redefinition we will define the conjugate fields by the same convention 
as for the original ones. That is, for 


v, =e uy, r = PNG, (9.16) 
we define 

vg =C", Ne sia NP. (9.17) 
Mixed Models 


When active and sterile neutrinos are both present, there can be Dirac and Majorana mass 
terms simultaneously. For one family, the Lagrangian density has the form 


-c=1( of o) ( n ) ( a ) pia (9.18) 
where 0 refers to weak eigenstates, and the masses are 
mr: |AL|=2, A =1 (Majorana) 
mp: |AL|=0, At = ; (Dirac) (9.19) 
ms: |AL|=2, A = 0 (Majorana). 


The two terms involving mp are equal since 


Vir Wer = VoLWar, VoRVe, = VorVaL 


é ee a - (9.20) 
PaL Vor = VoL Var; PaL DoR = Vor YaR 
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for arbitrary Pa, by (2.296) on page 56. Diagonalizing the matrix in (9.18) yields two 
Majorana mass eigenvalues m; and two Majorana mass eigenstates,’ ViM = viL Vir = ViM» 
with i = 1,2. The weak and mass bases are related by the unitary transformations 


0 c Oc 

VIL vif YE VIR \ _ art| YR 
=A ; 5 =A ; 9.21 
ea a(k) €2 (8) 9.21) 


similar to (8.47) on page 269. Az and Ap are generally different for Dirac mass matrices, 
which need not be Hermitian. However, the general 2 x 2 neutrino mass matrix in (9.18) 
is symmetric because of (9.20). This implies that AY = A} K, where K is a diagonal 
matrix of phases analogous to (8.52). (There is additional freedom in K in the presence of 
degeneracies.) K is in general arbitrary, but our phase convention, in which vf, = CH, 
implies K = I. 

There are several important special cases of the mixed model in (9.18): 


(a) Majorana: mp = 0 is the pure Majorana case: the mass matrix is diagonal, with 
mı = MrT, M2 = Mg, and 


20 ec _ ,,0c 
ML =UV s VIR Z YR (9.22) 
— ,,0c ec — pü ` 
VL ZVL, VəR Z VR. 


(b) Dirac: The Dirac limit is mr = ms = 0. There are formally two Majorana mass 


eigenstates, with eigenvalues mı = mp and m2; = —mp and eigenstates 
1 
nr =- Et), vir = aU TVR) 
V2 V2 
1 1 (9.23) 
0 Oc e 0c 0 
Yop, = ——(v; — vr’), Usp = — (vk — Vp). 
2 7a! LVL) 2R Ja! R R) 
Note that vı 2 are degenerate in the sense that |m | = [m2], but the actual eigenvalues 


have opposite sign. To recover the Dirac limit, we will depart from our usual procedure 
of redefining the phase of v2 to make mz positive. Rather, let us expand the mass term 
mp 


-L= > (LviR — PLVSp) + h.c. = mp (vey + ev), (9.24) 


which clearly conserves lepton number (i.e., there is no v? — v?° or v% — v?, mixing). 


Thus, a Dirac neutrino can be thought of as two Majorana neutrinos, with maximal 
(45°) mixing and with equal and opposite masses. This interpretation is useful in 
considering the Dirac limit of general models. 


(c) Seesaw: The limit ms > mpr (eg, mr = 0, mp = O(Mmu, Me, Ma), and 
mg = O(Mx), where Mx ~ 10!4 GeV) is known as the seesaw (or minimal see- 
saw) (Minkowski, 1977; Gell-Mann et al., 1979; Yanagida, 1979; Schechter and Valle, 


6For Majorana masses, and especially in mixed models, there is no conserved lepton number, and it is 
just a matter of definition to refer to the L states as particles, viz, and the R states as antiparticles, vf,. 
Unfortunately, this becomes awkward in the Dirac limit where the Weyl states vp and vf are labeled as 
particle and antiparticle because they carry lepton number +1 and —1, respectively. There is no notation 
known to the author that is not awkward in some circumstances. 
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1980). The eigenstates and eigenvalues in the seesaw limit are 


2 
m m 
0 D 0c 0 D 
Pir wip = a Uns SE a 
S S 
MD o 0c Oc 
lon ~ Tie vr tV NVL, M2 ~ ms, (9.25) 
S 


with |mi| K mp for mr = 0. At energies low compared to mg the v2 decouples and 
one obtains an effective theory involving a single active Majorana vim ~ v? + Vf. 
The minimal seesaw mechanism is illustrated in Figure 9.3. 


(d) Pseudo-Dirac: this is a perturbation on the Dirac case, with mr, ms < mp. There 
is asmall lepton number violation, and a small splitting between the magnitudes of the 
mass eigenvalues. As an example, mp = €, mg = 0 leads to |m1 2| = mp + €/2. The 
pseudo-Dirac case is also sometimes encountered for the variant Dirac forms involving 
two active or two sterile neutrinos. 


(e) Mixing: The general case in which mp and mg (and/or mr) are both small and com- 
parable leads to non-degenerate Majorana mass eigenvalues and significant ordinary- 
sterile (v? — v°°) mixing, such as were suggested by the LSND and some subsequent 
results to be described below. Only this and the pseudo-Dirac cases allow such mix- 


ings. 


9.1.3 Extension to Two or More Families 


These considerations can be generalized to two or more families, or even to the case of 
different numbers of active and sterile neutrinos. For F = 3 families, define the three- 
component weak eigenstate vectors 


0 Oc 

o_ | oa o rae 

— oe E 
v =| vr, |, vie = Yat , (9.26) 

[a4 

KSE VSI, 


and similarly for v% , v% and the two-component fields. In the Dirac case, (9.5) generalizes 
to 


-p= (2? Mpv% + PRM but) = (NE MDNR $ NR MINE) l (9.27) 


where Mp is a completely arbitrary 3 x 3 matrix. Mp can be diagonalized by separate left 
and right unitary matrices AT p, just as in (8.47) on page 269, 


mı 0 0 
AŻ MpAk =(Mp)p=| 0 m 0 |, (9.28) 
0 0 m3 
with mass eigenstate fields 
v, = Ay, VR = Aing. (9.29) 


The leptonic part of the weak charge raising current is then 


JH? = 20r Veer, (9.30) 
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where 
Ve = Az! AG, (9.31) 


is the leptonic mixing matrix, analogous to the quark mixing matrix V} in (8.61). V = Vv; 
is also know as the PMNS matrix, after Maki, Nakagawa, and Sakata (Maki et al., 1962) 
and Pontecorvo (Pontecorvo, 1968). The counting of angles and phases in Vp is the same 
as for the CKM matrix (see Equation 8.62). One can choose the arbitrary phases in Aï% 
(i.e., the KY matrix analogous to (8.51)) to remove unobservable phases from Vy (leaving 3 
angles and one phase for F = 3), and then choose those in A% (i.e., A) to make the mass 
eigenvalues real and nonnegative. 

For F4 = 3 active and Fs > 3 sterile neutrinos (and only Dirac masses), three linear 
combinations of the sterile fields will join with the active ones to form three massive Dirac 
fields, leaving Fs — 3 massless Wey] fields. The reverse situation occurs for Fs < F4. 

In the Majorana case the triplet mass term (9.6) generalizes to 


1 1 
Lr = 5 (22 Mro% + 7x Mj? ) =5 (23 Mre% 4 vremi) ; (9.32) 
where Mr is symmetric, Mp = MÆ, by (9.20). This symmetric property holds for all 
Majorana mass matrices. The corresponding two-component expression is 


1 
-Lr = 5 (M Mri? NY — Nio? MİNG) . (9.38) 


Since Mr is symmetric, it can be diagonalized by a transformation analogous to (9.28), 
but with AY = Ai’ K, where as usual K is an undetermined diagonal phase matrix in the 
absence of degeneracies. We will choose K = J in order to maintain v{, = Cv, for the 
eigenstates (cf., Equation 9.17). In that case, the phases in A¥ are uniquely determined by 
the requirement that the mass eigenvalues m; are real and positive.’ Consequently, there is 
less freedom to remove phases from V; than in the Dirac case (or in the CKM matrix). The 
counting in (8.62) is modified in that one can only remove the F phases associated with the 
charged lepton fields from the F? parameters in a general unitary F x F matrix, so that 
there are F(F — 1)/2 mixing angles and F(F — 1)/2 observable CP-violating phases. The 
upshot is that Vọ can be written 
Ve = K% Vr, (9.34) 

where V; involves 3(F — 1)(F — 2) phases analogous to those in the CKM matrix, such as 
the single phase for F = 3 displayed in (8.214) on page 340. KX, is a diagonal matrix of 
Majorana phases. Only the F'—1 phase differences are observable, so one often takes it to be 
of the form KY, = diag(e~’™,--- ,e~*-1, 1). Since the Majorana phases only multiply the 
mass eigenstate fields, they do not enter any amplitude involving only external neutrinos 
or those involving an ordinary (lepton-number conserving) internal neutrino line. They do 
affect amplitudes involving lepton number violation, such as the 88, amplitude in Figure 
9.2. 

To summarize, for F = 3 families, the adjoint of the leptonic mixing matrix can be 
parametrized by 


1 0 0 C13 0 $13e7 t? C12 Sig 0 efor 0 0 
Vv; = 0 C23 S23 0 1 0 —S12 C2 0 0 e122 0 5 
0 —S93 C923 — s136? 0 C13 0 0 1 0 0 1 
(9.35) 


’The phase would be undetermined but unobservable for a zero eigenvalue. 
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where cij; = cos@;; and s,;; = sin 0;j and 6 are leptonic mixing angles and a C P-violating 
phase similar to (but numerically different from) the angles in the CKM matrix in (8.214), 
and aj,2 are Majorana phases. The same form holds for Dirac masses, except the last factor 
becomes the identity. Note that we are following the conventions in (Patrignani, 2016), and 
that (8.214) refers to V} while (9.35) refers to vi. To conform to standard notations, we 
will also define the mixing matrix 

v=v}. (9.36) 
It is often convenient to choose a basis for the lepton doublets in which the charged lepton 
mass matrix is already diagonalized, i.e., A$, = I. Then, V = Aj. 

In the general multi-family case, with both Dirac and Majorana masses, 


1, 9 oof Mr M v% 
-L=3( 7} 7} et Me) (CB ) tho, (9.37) 


where Mr = Mt and Ms = Mé. For F = 3 there are six Majorana mass eigenvalues and 
eigenvectors. The transformation to the mass eigenstate basis is 


0 
vy = Aut ( T ) l (9.38) 
VL 
where Ay is a 6 x 6 unitary matrix and vz is a six-component vector. The analogous 
transformation for the R fields involves A% = AT*K because the 6 x 6 Majorana mass 
matrix is symmetric. Our phase convention vh = C ZA again implies the choice K = I. 
Analogous to (9.25), the seesaw limit of (9.37) occurs when the three eigenvalues of Ms 
are all large compared to the elements of Mp and Mr (the latter is usually assumed to be 
zero). Then, one has 


A 0 
AŻ! = AZT = ( Lo Ja (9.39) 
0 aS 
where A¥ and A}® are 3 x 3 unitary matrices, J and 0 are, respectively, the 3 x 3 identity 
and zero matrices, and 
I —MpM;' 
Buin ( = a ) (9.40) 
L (Mz M} I 
One finds PRE 
vi ( Mr Mp \ pix, (Mr—-MpM5s'MB 0 
BY ( ME mM, ) P , Ms) (9.41) 


That is, there are three light (approximately) active neutrinos with an effective Majorana 
mass matrix Mr — MpM,;'Mj, diagonalized by A% = A%*. There are also three heavy 
(approximately) sterile neutrinos with mass matrix Mg, which is diagonalized by A{S = 
Ai”. The latter decouple at energies small compared with their masses. 

In the more general case in which one or more of the eigenvalues of Mg are not large, some 
or all of the light states will include non-negligible sterile components. In particular, the 
active and sterile neutrinos of the same chirality will mix significantly (as was suggested by 
the LSND and some other experiments) if both Majorana and Dirac masses are of the same 
order of magnitude or in the pseudo-Dirac case. Constructing models with these features 
presents a special challenge compared to the Dirac, Majorana, and seesaw cases because 
one must find an explanation as to why two different types of mass terms are small. If all 
three sterile states remain light, the leptonic mixing matrix in the charge raising current in 
(9.30) becomes 6 x 3 dimensional, i.e., 


V; = AX PAS, (9.42) 
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where AY is the 6 x 6 neutrino mixing matrix and P4 = (J 0) is the 3 x 6 dimensional 
matrix that projects onto the active neutrino subspace. One can easily generalize to the 
case of F4 = 3 active neutrinos and Fs sterile neutrinos, in which case there are 3 + Fs 
Majorana mass eigenvalues and P4 becomes a 3 x (3 + Fs) dimensional projection. 

Our original definition of a Dirac mass term as one that couples two distinct Weyl fields 
and a Majorana mass as one that couples a Weyl field to itself is not very useful in the 
multi-family case. In the above discussion we implicitly referred to couplings between active 
and sterile neutrinos as Dirac, and active-active or sterile-sterile couplings as Majorana. 
However, an alternate definition, which we will now adopt, is to define Majorana or Dirac 
masses on the basis of the form of the mass eigenvalues and eigenvectors. One can then 
view Majorana mass terms as the generic case, and reserve the term Dirac for special or 
limiting cases in which there is a conserved lepton number. We already saw an example of 
the Dirac limit of the one family mixed model, which trivially generalizes to the F family 
case when Mr = Ms = 0, Mp £ 0. 

A less obvious example involves the two family Majorana mass matrix 


0 MZKM 
Mr = ; 9.43 
= tees 0 ) oe) 


This has the same form as the Dirac limit of (9.18), except that in this case the L and R 
components are both active. Let us be even more explicit, and assume that (9.43) holds in 
the basis in which the charged leptons (e and p) are already diagonal. Then we can identify 
VÊ; = VeL, vIe = VSR, v; = VuL, and ee = VR, 50 that 

MZKM /_ 


-Pps 5 (DeLVER + DuLVSR + VE RVeL + VERY pL) 


= MZKM (Ver, + Dir) (Ver + var) : 


(9.44) 


This Zeldovich-Konopinski-Mahmoud model (Zeldovich, 1952; Konopinski and Mahmoud, 
1953) involves a Dirac neutrino, in that there is a conserved quantum number (Le — Ly, 
rather than L = Le + L,,) and because two distinct Weyl neutrinos are involved. However, 
in many ways it is more closely related to the Majorana case, i.e., it violates weak isospin 
by one unit and is a limiting case of the general 2 x 2 Majorana matrix. Any perturbation 
involving nonzero diagonal elements of Mr would break the “degeneracy” mı = —m2 of 
the two eigenvalues. A modern F = 3 version of the ZKM model involves the matrix 


0 1 1 
Mr = MZKM 1 0 0 5 (9.45) 
1 0 0 


which leads to one massive Dirac neutrino vez + aM r + VÊg), and one massless Weyl 
neutrino Fur — vL), with Le — L, — L, conserved. This actually yields a spectrum 
somewhat similar to the observed one, but would require nontrivial perturbations both in 
Mr and the charged lepton mixing to be fully realistic. An example of a perturbation on 
Mr leading to a generalization of the pseudo-Dirac case is considered in Problem 9.2. 

An analogous situation sometimes occurs (especially in complicated models in which 
Fs > F4) when two sterile neutrinos pair to form a Dirac neutrino, e.g., with Mg of a form 
analogous to (9.43). 

Let us conclude this section by reemphasizing that there is no distinction between Dirac 
and Majorana neutrinos except by their masses (or by new BSM interactions). As the masses 
go to zero, the active components reduce to standard active Weyl neutrinos in both cases. 
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There are additional sterile Weyl neutrinos in the massless limit of the Dirac case, but these 
decouple from the other particles. We also repeat the comment from Section 8.2.2 that one 
can ignore Vp in processes for which the neutrino masses are too small to be relevant. They 
are then effectively degenerate (with vanishing mass) and one can work in the weak basis. 


9.1.4 The Propagators for Majorana Fermions 


For free Dirac fields there is a conserved fermion number and therefore only a single type of 


propagator, given in (2.182) on page 38. For Majorana neutrinos, on the other hand, there 
is no conserved fermion number and there are three non-zero propagators, 
= 1 . dtk —ik-(x—zx') 
(OIT [vm (2), Pm (x")]|0) = i JE Sr (k) 
(27) 
T 5 1 . dtk —ik-(x—x') pt 
(0|7 [Pur (2), Pm (z )]l0) = i e C'Sr(k) (9.46) 
(27)4 
1 - dtk —ik-(x—-2’) 
(0|7 [vm (x), vm (x")]|0) = i e Sp(k)(—C), 


(27)* 


where Sp (k) is the usual poe. The first Majorana propagator is analogous to the Dirac 
case, while the second and third can be thought of as annihilating or creating two neutrinos, 
respectively. They can easily be derived from 


>= ui, s) o(p, 8)" = (Ø + m) (-C) 
j (9.47) 
X alp, s)" alp, s) = Ct (Ø — m) 


and two similar identities with u << v and m — —m, which in turn follow immediately from 
(2.177), (2.290), and (2.291). Similar expressions apply to the Majorana fields that occur 
in supersymmetry. 

Expressions for amplitudes involving the second and third terms in (9.46) take an unusual 
form, but they can usually be rendered more familiar by use of (2.290) and (2.291) on 
page 55, or by the use of (2.296) along with vm = 1{,. As a simple example, consider the 
process W- W- — e~ e~, assuming that the ve is Majorana and ignoring all family mixing 
effects. This proceeds via the diagrams in Figure 9.4, which form a critical part of those for 
BPov in Figure 9.2. The relevant WCC interaction is 


g = 
L= -Jy W; 9.48 
aii (9.48) 
where 

Jy = E (1 — 9? ur = E" (1 — 7° )vm, (9.49) 

since PvR = 0. The amplitude from the t-channel diagram is therefore 

= 5 = 5 Kk+mr 

m= (54 ig eela -alara = 7°)] 9 [a0] a 00 


where k = p3 — pi, mr is the Majorana neutrino mass, the propagator follows from the last 
expression in (9.46), and we have assumed |k?| > m2. This expression can be simplified 
using (2.289), (2.291), and (2.292), 


[ia (1 —7°)] p(—C)5a = - [A - 7 )y’u4] 5, (9.51) 
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so that j 
—ig —2imr\ _ Boy 
M; = | —— v | —— PL - , 9.52 
t (5) €1p€2 ( 2 ) ty (= 7°) va (9.52) 


which can be evaluated in the usual way. Note that the ū4 spinor has been replaced by v4. 
The & part of the propagator has dropped out because of the pure V — A interaction, but 
would be allowed if there were an admixture of V + A. The same result can be obtained 
more directly by rewriting 


Jw = ey” (1 — yï Wm = —0my” (1 + yee (9.53) 


obtained from (2.296) for one of the vertices. Then, the first propagator in (9.46) is the 
relevant one, and we must use a v spinor for ez since e° is the positron field. 


Figure 9.4 Diagrams for W`- W7 — ee” assuming a Majorana neutrino. Left: t- 
channel. Right: u-channel. The cross represents a Majorana mass insertion. 


9.2 — EXPERIMENTS AND OBSERVATIONS 


In this section we describe the principal laboratory and astrophysical constraints on the 
number of light active and sterile neutrinos, and their masses and mixings. The constraints 
on possible heavy Dirac or Majorana neutrinos and on neutrino decays® are reviewed in (Raf- 
felt, 1999; Mohapatra and Pal, 2004; Atre et al., 2009; Giunti and Studenikin, 2015; Faessler 
et al., 2014; Drewes and Garbrecht, 2015; de Gouvéa and Kobach, 2016). 


9.2.1 Neutrino Counting 


As discussed on page 309, the width for Z to decay invisibly implies that there are only three 
active neutrinos with masses < Mz/2. More precisely, Ni”” = 2.992 + 0.007 from the global 
fit to precision data? (Patrignani, 2016), where Ni”” does not include sterile neutrinos or 
very heavy active neutrinos, but does include the effects of other possible invisible decay 
channels asssociated with new physics. Precision constraints also exclude additional heavy 


8 Radiative decays, such as v2 > eten, if kinematically allowed, or v2 — vıy, are possible in the SM. 
The latter is loop suppressed and extremely slow. BSM physics could allow faster v2 — vıy or invisible 
decays, such as v2 > 141171, V2 > vı F, or v2 + D1 M, where F is a familon (a Goldstone boson associated 
with a hypothetical broken family symmetry), and M is a Majoron (a Goldstone boson associated with a 
spontaneously broken lepton number). 

There is also a more direct determination of the invisible width from e~e+ — y+ invisible, yielding 
Nine = 2.92 + 0.05. 
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active neutrinos if they belong to a complete degenerate chiral family, but may be evaded, 
e.g., for non-chiral doublets or nondegenerate families. 

Another major constraint comes from big bang nucleosynthesis (BBN) (e.g., Kolb and 
Turner, 1990; Dolgov, 2002; Steigman, 2012; Cyburt et al., 2016; Patrignani, 2016), which 
has also been a critical test of hot big bang cosmology and of many possible types of 
nonstandard particle physics. The basic point is that the reactions 


n+Vve e p+e, nt+et opti (9.54) 

: f Pee En—E 
kept the ratio of neutrons to protons in thermal equilibrium p = exp(——";-*) ~ 
exp(— 72) in the early universe as long as the reaction rate T ~ GZT° was larger 


than the expansion rate (Hubble parameter) H ~ 1.66,/g, T? /Mp, where Mp is the Planck 
mass. H? is proportional to the energy density p = g.7°T*/30, where gs = gp + {gr and 
gB,F are the number of relativistic bosonic and fermionic degrees of freedom in equilib- 
rium at temperature T. The equilibrium was maintained until the freezeout temperature 
Ty ~ (VGx/G%Mp)'/? = O(few MeV) when I ~ H, at which time the n/p ratio was frozen 


Mn—Mp 


at the value exp(— 


) except for neutron decay, and most of the neutrons were eventu- 


ally incorporated into “He. By apparent coincidence T} is close to Mm, — Mp, so the expected 
abundance depends sensitively on gą. In the SM, one expects gẹ = 43/4 for Mme < T < mp, 
(gg = 2 from two photon helicities and gr = 10 from 3(vz, + vh) + two helicities each of 
e+). This leads to the prediction that the ratio of primordial 4He to H by mass should 
be ~24%, in agreement with observations.'!? However, any additional contribution to the 
energy density for T = few MeV would increase H and T;, and therefore the predicted 
helium abundance. This can be parametrized by writing 


gr =4+2NBBN (9.55) 


where any deviation of the effective N??% from'! 3.046 could indicate new light degrees of 
freedom in (partial) equilibrium, or such effects as neutrino masses of O(MeV) or neutrino 
decay. There has long been some uncertainty and controversy in the observational primordial 
abundance, and therefore the limits on N??%. Recent estimates include NB?N = 3.7 + 
0.5 (Steigman, 2012) and N28" = 2.9+0.3 (Cyburt et al., 2016). ANPPBN = NPPN _ 3.046 
constrains not only additional active neutrinos with masses < 1 MeV, but also light sterile 
neutrinos of the type suggested by the LSND experiment, which could be produced by 
mixing with active neutrinos for a wide range of parameters (e.g., Dolgov, 2002; Cirelli 
et al., 2005; Hannestad et al., 2012). However, ANPPN does not include the sterile vz 
components of light Dirac neutrinos, which could not (for the currently relevant mass ranges) 
have been produced in equilibrium numbers unless they have new BSM interactions or 
properties (Barger et al., 2003b; Anchordoqui et al., 2013). Of course, other light BSM 
particles besides sterile neutrinos could contribute to ANP? , such as (pseudo-) Goldstone 
bosons associated with some new symmetry (Weinberg, 2013). 

Most new physics effects increase the predicted +He abundance, leading to a more strin- 
gent upper limit on AN??%. One important exception is a possible large asymmetry be- 
tween v and P, which would preferentially drive the reactions in (9.54) to the right, de- 
creasing the n/p ratio and allowing a larger AN,P?%.. (Only the ve — De asymmetry directly 
affects the reactions, but the observed neutrino mixing would probably have equilibrated 


10There is also a weak dependence on the baryon density relative to photons, which is determined inde- 
pendently by the D abundance and by the cosmic microwave background anisotropies. 

11The standard model value for N,P?% differs slightly from 3 due to such effects as non-instantaneous 
neutrino decoupling. 
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the asymmetries between the families.) However, such an asymmetry would have to be enor- 
mous, (ny — np)/ny ~ O(0.1), compared to the baryon or charged lepton asymmetries to 
have much effect. Even allowing it, the present constraints from BBN along with the cosmic 
microwave background (CMB) and large scale structure data imply that such asymmetries 
would not significantly perturb the constraints on ANP? (Simha and Steigman, 2008). 
A large asymmetry could lead to important nonlinear effects in the case of active-sterile 
neutrino mixing, however (Foot et al., 1996). 

There are also stringent constraints on the number of neutrinos and their masses from 
the CMB and from the distribution of galaxies (e.g., Wong, 2011; Lesgourgues and Pastor, 
2012; Patrignani, 2016). For example, the CMB anisotropies depend on the number of 
relativistic degrees of freedom that were present at recombination, when the universe had 
cooled sufficiently (to T ~ 0.26 eV, or redshift z ~ 1100) for neutral atoms to form so that 
the photons decoupled from matter. The WMAP (Hinshaw et al., 2013) and Planck (Ade 
et al., 2016) collaborations have made very detailed studies of the CMB. When combined 
with galaxy distributions, the Planck analysis obtains NO’? = 3.15 + 0.23 for the number 
of neutrinos (active and sterile, weighted by their abundance), as well as other BSM forms of 
dark radiation, that were relativistic at recombination.!? This is consistent with 3.046, but 
also allows one or more additional species if they have a reduced abundance. The effects of 
and constraints on active or sterile neutrino masses are discussed in the the next subsection. 


9.2.2 Neutrino Mass Constraints 


A stringent kinematic limit on the effective ve mass-squared 


mi, =X Vei? m? (9.56) 


can be obtained from the shape of the e~ spectrum near the endpoint in tritium ( decay, 
3H — 3He ex (e.g., Otten and Weinheimer, 2008; Dragoun and Vénos, 2016). In (9.56) 
mi is the it” mass eigenvalue, independent of whether it is Dirac or Majorana, and V is the 
leptonic mixing matrix as defined in (9.36). The current limits from experiments in Troitsk 
and Mainz are respectively m,, < 2.05 eV and < 2.3 eV at 95% c.l. The Karlsruhe KATRIN 
experiment should improve the sensitivity on m,, down to around 0.2 eV. KATRIN and 
proposals for future experiments are reviewed in (Drexlin et al., 2013). 

The kinematic limits on the v, and v, masses, defined analogously to (9.56), are 
much weaker: m,, < 0.19 MeV from mt > pir iy and my, < 18.2 MeV from 77 —> 
3r*v,,57*(2°)v,. These bounds are now superseded by much more stringent ones from 
neutrino oscillations and cosmology. However, it is historically interesting that the com- 
bination of this bound on m,, from ALEPH with the BBN constraint (which becomes 
relevant because an ~ (1 — 20) MeV neutrino contributes more than 1 to N??) excluded 
the possibility of a stable or long-lived v, above 1 MeV (e.g., Fields et al., 1997). 

We also mention the historically important observation of a burst of O(20) neutrinos 
(presumably mainly De’s) from the core-collapse Supernova 1987A by the Kamiokande, IMB, 
and Baksan collaborations, which implied (amongst many other things!) that my, < 20 eV, 


12The various cosmological limits on the number and masses of neutrinos depend somewhat on the data 
set chosen and on possible correlations with other parameters. 

13For example, the neutrinos arrived within a few hours of the supernova photons, whereas they would 
have arrived ~ 5 months sooner (after a 160,000 year journey) if they did not share the same gravitational 
interactions, testing the weak equivalence principle (Longo, 1988; Krauss and Tremaine, 1988). 
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because otherwise the arrival times of the detected neutrinos would have spread out more 
than was observed. It was hard to make the limit precise, however, because it depended 
on theoretical details of the neutrino emission (e.g., Bahcall, 1989; Raffelt, 1999). Core- 
collapse supernovae are expected to occur in our galaxy at the rate of several per century. 
Observation of the v’s from such a supernova in neutrino and other detectors would yield 
a wealth of information on neutrino properties as well as on the dynamics of the super- 
nova explosion (e.g., Duan et al., 2010; Scholberg, 2012; Mirizzi et al., 2016). The diffuse 
background flux of neutrinos from core-collapse supernovae in other galaxies may also be 
observable (Beacom, 2010). 

Light massive neutrinos would contribute to the cosmological energy density, and they 
would close the Universe, Q, = 1, for X ~ 35 eV, where 


L= >, Im;l (9.57) 


is the sum of the masses of the light active neutrinos. Observationally, some 26-27% of 
the energy density is dark matter (Patrignani, 2016), but it is most likely cold dark mat- 
ter (CDM), which was non-relativistic at decoupling, such as weakly interacting massive 
particles (WIMPs) or axions. Light neutrinos would be hot dark matter (relativistic at 
decoupling), which would free-stream away from density perturbations, preventing the for- 
mation of the observed smaller scale structures during the lifetime of the universe.!4 Smaller 
neutrino masses (close to the recombination temperature ~ 0.26 eV) would lead to subtle 
effects in the CMB and galaxy distributions (e.g., Wong, 2011; Abazajian et al., 2011; 
Lesgourgues and Pastor, 2012). For example, the Planck collaboration (Ade et al., 2016) 
finds 

£ < 0.23 eV at 95% c.l. (9.58) 


for the sum of the active neutrino masses (assuming NOMB = 3.046) from the combination 
of CMB and galaxy data, while some other recent data sets yield tighter or weaker lim- 
its (e.g., Abazajian and Kaplinghat, 2016). Future cosmological observations should be able 
to extend the sensitivity to © down to or below the minimum value 0.05 eV ~ \/|Am2,,,.| 
allowed by the neutrino oscillation data (Abazajian et al., 2015). 

Allowing for light sterile neutrinos as well one must take into account the extra con- 
tribution to the radiation. Furthermore, the cosmological observables depend on how the 
masses are distributed amongst the states, possible asymmetries, and possible non-thermal 
production. The Planck collaboration considered the example of one light (ms < 10 eV) 
thermally-produced sterile neutrino. In that case, ANOMB = (Ts/T,)*, where Tg and T, 
are, respectively, the sterile and active neutrino temperatures (e.g., Kolb and Turner, 1990). 
(One expects Ts < T, due to earlier decoupling.) Assuming also that © = 0.06 eV and no 
asymmetries, they obtained the correlated 95% c.l. limits 


NOME 235 mal? 2052 eV, (9.59) 


where the effective mass me f is mg weighted by the sterile abundance, i.e., me f= 


(Ts tT)? mg. 


14Warm dark matter, e.g., from keV mass sterile neutrinos, is an intermediate possibility. Theoretical, 
astrophysical, and cosmological implications of intermediate mass sterile neutrinos are reviewed in (Kusenko, 
2009; Merle, 2013; Adhikari et al., 2017). An example, the neutrino minimal standard model (vMSM), is 
reviewed in (Boyarsky et al., 2009). 
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9.2.3 Neutrinoless Double Beta Decay 


Majorana masses can lead to Bov, i.e., nn — ppe~ e~, which violates lepton number by two 
units, by the diagrams in Figure 9.2 (for reviews, see Rodejohann, 2011; Vergados et al., 
2012; Bilenky and Giunti, 2015; Pas and Rodejohann, 2015; Dell’Oro et al., 2016). Since 
there is no missing energy,’? events should show up as a peak of known energy in the 
ee spectrum from a sample of 86 ,-unstable nuclei. However, the process would have an 
extremely long half-life, so problems of backgrounds are severe. 

The amplitude for BSo, is M ~ Anue™mgs, where Anuc contains the nuclear matrix ele- 
ment. Anuc cannot be directly measured and therefore introduces considerable uncertainty 
into the interpretation of any upper limit or future observation (Vergados et al., 2012; Vogel, 
2012; Simkovic et al., 2013). mgg is the effective Majorana mass in the presence of mixing 
between light Majorana neutrinos, 


mgg = oa)? (9.60) 


It is just the (e,e) element of mr or of the effective Majorana mass matrix in a seesaw 
model (i.e., the (1,1) element in the family basis in which A¢ = J). It involves the square 
of Vei rather than the absolute square, allowing for the possibility of cancellations between 
terms. Such cancellations could occur even if the original mass matrix were real because 
some of the eigenvalues could be negative. (In our phase convention the m; are taken to be 
positive, but the signs would appear because some of the Majorana phases in (9.35) would 
then be +i.) This also shows why the 86o amplitude vanishes for a Dirac neutrino, which 
can be viewed as two Majorana neutrinos that give equal and opposite contributions. The 
cancellations could in principle allow the determination of (C P-violating) Majorana phases 
different from 0 or +i, though this is difficult in practice because the other parameters 
including the matrix elements would have to be known rather well (Barger et al., 2002; 
Pascoli et al., 2002). 

There are several precise limits on 66o, in various nuclei,!® including KamLAND-ZEN 
and EXO-200 [13°Xe], GERDA [Ge], and CUORE-O [!°°Te] (for a review, see Ostrovskiy 
and O’Sullivan, 2016). A combination of these yields mgg < (0.13—0.31) eV at 90% c.l. (Gu- 
zowski et al., 2015), with the range due to the nuclear matrix element uncertainties. The 
most recent KamLAND-ZEN result (Gando et al., 2016) (not included in the combination) 
obtains a 90% c.l. lower limit of 1.1 x 107° yr on the °6Xe Bbo, half-life. This corresponds 
to mgg < (0.06 — 0.17) eV, very close to the range expected for the inverted hierarchy 
described in Section 9.4. Future experiments should be sensitive down to O(0.01 — 0.02 eV) 
or better (e.g., Ostrovskiy and O’Sullivan, 2016). They should be sufficient to observe Bov 
if the neutrinos are Majorana with masses corresponding to the inverted or degenerate spec- 
tra, but no scheduled experiment would be sensitive to the normal hierarchy. (See Biller, 
2013, however.) 

A heavy Majorana neutrino could also contribute to mgg, but its contribution would be 


15Two-neutrino double beta decay, 882v, is the process nn —> ppe~e~ vi, which can occur by ordinary 
second-order weak processes in some -stable nuclei. It leads to a continuous e` e~ spectrum and has been 
studied in a number of nuclei (Saakyan, 2013). It is helpful for testing calculations of the nuclear matrix 
elements entering 889,. A third possibiity is Majoron decay, nn > ppe~e~ M or BGovm, where M is a 
Majoron (Goldstone boson). It would lead to a spectrum intermediate between Bboy and BB2v. 

16 An observation of 8Go, in “Ge with a half-life ~ 2 x 1025 yr has been claimed (Klapdor-Kleingrothaus 
and Krivosheina, 2006) by members of the Heidelberg-Moscow experiment. This would correspond to mgg ~ 
(0.25—0.60) eV. However, the result has not been confirmed by other experiments, and is apparently excluded 
by the 136 Xe results for plausible nuclear matrix elements. 


390 The Standard Model and Beyond 


suppressed by finite range effects (i-e., the m? in the denominator of the propagator would 
be important), 


mgg > [S Vei) m, F (mi, A) ; F(m;, A) = ET (9.61) 


where A is the nucleon number. F(m;, A) is ~ 1 for m; < 10 MeV, but falls rapidly 
for larger values (e.g., Vergados et al., 2012; Faessler et al., 2014). Lepton-number violating 
effects other than Majorana neutrino masses could also lead to 88o, (although the existence 
of a Bo, amplitude implies the existence of a Majorana mass at some level (Schechter and 
Valle, 1982)). For example, models involving both V — A and V + A interactions as well as 
lepton number violation can induce 389, amplitudes not directly proportional to neutrino 
masses (Problem 9.3). There could also be effects from new interactions such as leptoquarks 
or R-parity violation in supersymmetry. If 8oy is observed, it would be useful to study it 
in several different nuclei, both to help control nuclear matrix element uncertainties and to 
shed some light on the underlying mechanism. 


9.2.4 Relic Neutrinos 


The BBN and CMB constraints on the cosmological neutrinos are indirect. Following their 
decoupling at T ~ few MeV the neutrino wavelengths were redshifted so that their mo- 
mentum distribution should at present have a thermal form, characterized by an effective 
temperature T, = (4/11)! T} ~ 1.9K, where Ty ~ 2.73 K is the CMB temperature and the 
(4/11)'/8 factor is because the y’s but not the neutrinos were reheated by e~ e+ annihilation 
at T Sm, (Steigman, 1979; Kolb and Turner, 1990; Weinberg, 2008). This corresponds to 
a number density of ~ 50/cm? for each neutrino degree of freedom, i.e., ~ 300/cm® for 3 
flavors with two helicity states. Local clustering and modifications of the momentum dis- 
tribution are not expected to be large unless the masses are = 0.1 eV (e.g., Ringwald and 
Wong, 2004). Direct detection of these relic neutrinos appears extremely difficult. Effects 
involving macroscopic torques or forces (Stodolsky, 1975; Cabibbo and Maiani, 1982; Lan- 
gacker et al., 1983) are tiny. Another possibility are Z bursts, in which ultra high energy 
cosmic ray neutrinos annihilate on relic neutrinos to produce Z’s (Weiler, 1982; Eberle et al., 
2004), which could be observed through their decay products or as absorption dips in the 
cosmic ray v spectrum. However, this would only be feasible if there were some unexpected 
intense source of such high energy neutrinos. More promising are v-induced e* emission 
by nuclei (e.g., Weinberg, 1962; Cocco et al., 2007; Lazauskas et al., 2008), which would 
show up as e~ emission above the 8 decay endpoint. The expected rate for KATRIN is 
not encouraging (Faessler et al., 2017). However, the recent PTOLEMY proposal for a very 
large (100 g) tritium source deposited on a graphene substrate (Betts et al., 2013) might 
be sensitive to relic neutrinos for a degenerate spectrum with m; = 0.1 eV, with a larger 
rate for Majorana than Dirac neutrinos (Long et al., 2014). Such experiments would also 
have sensitivity to heavier sterile neutrinos and to v — P asymmetries. For reviews of relic 
neutrinos, see (Gelmini, 2005; Strumia and Vissani, 2006; Ringwald, 2009). 


9.2.5 Electromagnetic Form Factors 


Neutrinos have no electric charge, but they can acquire magnetic and electric dipole mo- 
ments by diagrams analogous to the weak corrections to the muon magnetic moment in 
Figure 2.20 on page 72 or from new physics. These lead to effective electromagnetic inter- 
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actions 


i ' 
Hrug = gtin” MiP a uT (9.62) 


where n; can represent either a Dirac (vip) or Majorana (vim) mass eigenstate field. The 
first (second) terms are magnetic (electric) dipole interactions, as can be seen from (2.358), 
(8.324), and Problem 2.30. The flavor-diagonal terms i = j are known as direct or intrinsic 
moments, while those for i 4 j are transition moments. 
The magnetic and electric dipoles flip chirality. In the Majorana case, vim = ViL + VER, 
so that 
Dimo” vim = DiLOoÄ” Vip + Diro” viL. (9.63) 


A ec cP 
However, using Vfp = Ci;,, 


DjLO” ViR = veRCo CoR, = —DiLo"” Vip, (9.64) 
so that uij = —uji. (This can also be seen from (2.303) on page 56.) Majorana neutrinos 
therefore cannot have direct magnetic moments, but can have transition moments pij 4 0 
for i # j, which can mediate decays such as vim —> vimy. Similar statements apply to 
electric dipole moments. 

Both direct and transition moments are possible for Dirac neutrinos vip = ViL + ViR- 
For a single flavor 
Dpo’ vp = Dro” vR + DRO” VL. (9.65) 


One can write vz and vp in terms of two degenerate Majorana neutrinos using (9.23), 


1 1 
VL = gp” +L), VR = Jair —V3R), (9.66) 
where the unnecessary superscript 0 has been dropped. Then 


1 1 
— wv — = HVC — ae 
DLO" VR = —— yo" V5 p + 20" VIR. 9.67 
v2 2R V2 1R ( ) 
That is, a direct Dirac magnetic (or electric) moment is an antisymmetric combination of 
transition moments between degenerate Majorana states. 
In the simplest extension of the SM with a small Dirac mass m;, the direct neutrino 
magnetic moment is (Marciano and Sanda, 1977; Lee and Shrock, 1977) 


; 3eG pm; 
me 8/212 


where upg is the Bohr magneton. This is negligibly small compared with laboratory limits 
< few x10~' pp (Vogel and Engel, 1989; Giunti and Studenikin, 2015; Giunti et al., 2016), 
and various astrophysical limits, e.g., from stellar cooling, uy < few x 10~!* (Raffelt, 1999; 
Giunti and Studenikin, 2015; Giunti et al., 2016). The latter often applies to electric dipole 
moments as well, and to both Dirac and Majorana transition moments. 

One can construct models with magnetic moments that are much larger than (9.68), as 
was motivated by an alternative solution to the solar neutrino problem involving resonant 
spin-flavor precession in an assumed strong solar magnetic field (Akhmedoy, 1988; Lim 
and Marciano, 1988). However, there is a limit as to how large they can be. For Dirac 
neutrinos, higher-dimensional operators that can generate a magnetic dipole moment u? 
also contribute to m, (Bell et al., 2005). If the operators are generated by new physics at a 
scale > 1 TeV then there is an upper limit on u? of around 10715 upg for my < 0.3 eV. The 


~ 3.2 x 10712 (=) TA (9.68) 
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corresponding limits in the Majorana case are much weaker because of Yukawa suppressions 
to the Majorana mass (Bell et al., 2006). Thus, observation of a dipole moment above ~ 
10715 upg would imply that the mass is Majorana (or that there are fine-tuned cancellations 
between contributions to m,). 


9.3 NEUTRINO OSCILLATIONS 


9.3.1 Oscillations in Vacuum 


Neutrino oscillations are analogous to the neutral K and B meson oscillations described in 
Sections 8.6.3 and 8.6.4, and occur due to the mismatch between weak and mass eigenstates. 
They do not mix the neutrino helicities, and are therefore independent of whether the masses 
are Majorana or Dirac. First consider two neutrino flavors, ve and v,,, related to the mass 
eigenstates by 


|v.) = |v) cos 0 + |v2) sin 0, |V.) = —|r1) sin 0 + |v2) cos 0, (9.69) 


where 0, which corresponds to 612 in (9.35), is the neutrino mixing angle. Suppose that one 
starts at time t = 0 with a pure state |v(0)) = |v,,) of definite momentum" |p| from the 
decay 7+ + utv. The two mass eigenstate components each develop with their own time 
dependence, so that 


|v(t)) = —|v1) sin be *1! + |v) cos Oe *”2* 


met .m2t ‘ 9.70 
~ |—|vi) sin bet? + |v) cose" | es, ey 


at a later time t, where we have assumed that the neutrinos are extremely relativistic, so 
that E; = /|\p|? +m? ~ E+ m?/2E where E ~ |p|. After traveling a distance L there is 
a probability 


mt : 
P,,,-»v.(L) = |(velv(t)) [2 = sin? 6 cos? 6 j-e peti 


(9.71) 


Am2L 
= sin? 26 sin? (AS) = sin? 26 sin? | 


1.27Am? (eV?) L(km) 
E(GeV) 


for the neutrino to have oscillated into a ve, where Am? = m? — m? and L ~ t. The ve 
could be detected, e.g., by the reaction ven —> e~ p. The oscillation length is defined as 


(9.72) 


Such vacuum oscillations depend only on |Am?| and not on the absolute mass scale or on the 
hierarchy (which mass is larger). In appearance experiments one searches for the production 
of a different neutrino flavor than one started from, such as by the production of an e~ or 
T~ in an initial v, beam. In an idealized experiment with precisely known L and E for each 
event one observes not only the appearance of the new flavor, but the characteristic L/E 


17This simple approximation yields the correct result. However, a complete treatment requires consider- 
ation of the coherence of the initial and final wave packets, the relation between t and the location L of the 
detector, entanglement with other particles, etc. For recent discussions, see, e.g., (Akhmedov and Smirnov, 
2009; Cohen et al., 2009; Kayser et al., 2010). 
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dependence. However, for large Am?L/E the oscillations are averaged in practice by the 
spread and uncertainties in neutrino energy F and by finite detector/source sizes, so that 


1 
Pove (L) > sin? 20, (9.73) 


which is the same result one would have obtained from an incoherent superposition of nı 
and v2. In a disappearance experiment, one searches for the reduction in the flux of the 
initial v, (or other flavor) as a function of L and E, making use of the survival probability 
Pov, (L) = 1 — P,,-+,(L) for the state to remain a v,. For both types of experiment, 
careful attention has to be paid to the initial flux and spectrum (obtained from other 
measurements, theory, or by an initial calibration detector) and to backgrounds. 

Even with more than two types of neutrino, it is sometimes a good approximation to 
use the two-neutrino formalism in the analysis of a given experiment, e.g., if some of the 
mixings are small or if some of the Aj; = m? — m? are small compared to E/L (see below), 
and most results are presented in terms of allowed or excluded regions in the sin? 29 — Am? 
plane or the tan? 0 — Am? plane,'® whether or not that is really valid. However, a more 
precise or general analysis should take all three neutrinos into account. It is straightforward 
to show that the oscillation probability for Va —> 1 after a distance L is 


Py (L) =dab — 45 Re (Vii Voi Vaj Vij) sin? (3) 


i<j 4E 
í hese (9.74) 
+ p2 Sm (Vý: Voi Vaj Voz) sin ( ) ‘ 


where Va and m are weak (flavor) eigenstates and V is the leptonic mixing matrix in (9.35). 
For antineutrinos, P5,-+5,(L) is given by the same formula, except the sign of the last term is 
reversed.' It is apparent from (9.74) that P,,-,,,(Z) is the same as P,, ;,,(L) except that 
V — V*, and that any difference between them is due to CP-violating phases in V. The 
combination V7;VeiVajV;; is a Jarlskog invariant, i.e., independent of phase conventions. 
The Majorana phases do not enter, so C'P-violation in neutrino oscillations requires mixing 
between at least 3 families, just as in the CKM matrix. For F = 3, it is given by the 
phase 6 in (9.35), and all CP-violating effects would vanish for sı3 = 0. In practice, it 
is extremely difficult to compare P,,_,,,(Z) and P,,-4,,(L) directly, because, e.g., De are 
mainly produced at reactors, and v,,(V,,) at accelerators. However, CPT, which is built into 
the expressions above, implies that P,,4v,(L) = Ps,-+5,(L). Thus, 


Poon (L) —— Po, 0, (L). (9.75) 
yvyVr 


The oscillation rates for Va vs Da can be compared, e.g., by using v, (Du) beams from 
nt > pty, ( TT > WD), and any difference must be due to leptonic CP violation. 
From (9.74) and (9.75) the survival probabilities in vacuum must be equal, Pp >v, (L) = 
P;,-30,(L); CP violation in vacuum can therefore only occur in appearance experiments. 
The discussion above assumes the validity of CPT. Possible CPT violation in the neutrino 
sector is discussed in (Diaz and Kostelecky, 2012). 


18One usually labels the mass eigenstates so that Am? > 0. The cases 0 < 0 < 7/4 and 7/4 < 0 < 1/2 
are physically different. However, they cannot be distinguished by vacuum oscillation experiments, so it was 
traditional to use the variable sin? 20 to describe the results. Matter effects, however, can distinguish the 
two cases (de Gouvêa et al., 2000), so it is better to use tan? 0 instead. The region tan? 0 > 1 is sometimes 
known as the “dark side”. The cases 0 < or > 7/4 also differ by subleading effects for more than two flavors. 

19), and iq are, respectively, the left- and right-chiral states annihilated by vaz and VER: 
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Oscillations between active neutrinos of different flavors are known as first class (flavor) 
oscillations. The results in (9.74) and (9.75) generalize to second class oscillations (Barger 
et al., 1980a) between light active and sterile neutrinos of the same helicity, which can oc- 
cur when there are both Majorana and Dirac mass terms. For example, mixing between 3 
active and 3 sterile neutrinos would still be described by (9.74), except V is now a 6 x 6 
unitary matrix. Other phenomena can sometimes mimic neutrino oscillations, including non- 
standard interactions in the source, detector, or matter (e.g., Gavela et al., 2009; Ohlsson, 
2013); neutrino-antineutrino transitions (involving new interactions to flip helicity) (Lan- 
gacker and Wang, 1998); and massless neutrinos that are non-orthogonal due to mixing 
with heavy states (Langacker and London, 1988a). The latter can be generalized to massive 
oscillating neutrinos with an effectively non-unitary mixing matrix, again due to neglecting 
the mixing with heavier neutrinos (Antusch et al., 2006; Antusch and Fischer, 2014). 

An important special case of (9.74) occurs when the mass eigenstates can be divided 
into two sets, each of which is nearly degenerate compared to the E/L of the experiment. 
That is, consider F mass eigenstates, in which v;, i = 1---n, are close in mass, as are 
vj, j =n+1---F. If L/E is such that Ay L/E can be neglected when k and l are in the 
same set, then it is straightforward to show that 


Am2L 
P,,-sv,(L) = Po, >n, (L) = sin? 20,5 sin? ( a ) (9.76) 
for a Æ b, where 
n 2 F z 
sin? 20a, = 4| X Vý Voi) =4| XO ViVo; , (9.77) 
i=1 j=n+1 


and A = m? — mọ for i < n < j. The last form in (9.77) follows from unitarity. Thus, if one 
can neglect all but one mass splitting the two-neutrino formula holds, although the effective 
mixing angle may be a complicated function of the elements of V. In particular, there are 
no C P-violating effects in this limit. Similarly, the survival probabilities are given by 


(9.78) 


Am?L 
Pou, (L) = Poop, (L) = 1 — sin? 20aa sin? ( - ); 


4E 


with 


sin’ 20a = )_ sin? 20. =4 5 [Vai]? (: -> vaf’) , (9.79) 
1 k=1 


bA4a i= 


which implies 


P n 
sin? baa =E XO Waj? 51- Y Vail. (9.80) 
j=n+1 i=1 


Examples will be given below for oscillations involving atmospheric and sterile neutrinos. 


9.3.2 The Mikheyev-Smirnov-Wolfenstein (MSW) Effect 


Equation (9.70) or its generalization to three or more flavors describes the time evolution of 
an (initial) weak eigenstate in vacuum. However, for propagation through matter, such as the 
Sun or Earth, one must take into account the phase changes associated with the coherent 
forward scattering of the neutrinos with the matter, very much like index of refraction 
effects in optics (Wolfenstein, 1978). Under appropriate conditions, the matter effects can 


Neutrino Mass and Mixing 395 


combine with the neutrino masses to yield an effective degeneracy and therefore an enhanced 
transition probability, the MSW resonance (Mikheyev and Smirnov, 1985). 
To see how this works, consider the time evolution of a neutrino state 


=>) ala), (9.81) 


in vacuum. The weak eigenstates |v,,) are related to the mass eigenstates |v;) by 


Ya) = X |v) (Vi |Va) = 2 iri). (9.82) 


(In (9.82) and the following, we ignore the momentum integrals and delta functions, as well 
as the (27)°2E normalization factors, all of which cancel between the matrix elements and 


the sums over intermediate states.) Evolving the mass eigenstates analogously to (9.70), 
one finds that the coefficients satisfy the Schrodinger-like equation 


i—Ca(t) = (Va|Hv |r) or(t), (9.83) 


where the vacuum Hamiltonian (due to the masses) is 
(ValHv |v) = X Vai Bibs ~ > Vai x mi E Vi + Boban. (9.84) 


The last term only affects the irrelevant overall phase of the state. This and similar multiples 
of the identity can be dropped in what follows. For two flavors, Hy becomes 


Am? a cos20 sin 


Hv = 4E sin20 cos20 


(9.85) 


using the conventions in (9.69), with Am? = m3 — m7. 
In the presence of matter, (9.83) must be modified by the addition of the effective 


(matter) Hamiltonian 


2G 
Hm = pez iy: o Dar Y" VaL (pr Yulo E gA zi 5Y), (9.86) 


r 


which describes the scattering of va from fermions r = e, p,n, where g{/, are the effective 
vector and axial couplings, which receive contributions from Z exchange and (in the case of 
r =e) from W exchange, as in (8.93) on page 280. The brackets on the last term indicate 
an expectation value in the static medium. Assuming the medium is unpolarized, 


(pr Yul = 947° Jhr) = WW nÒ, (9.87) 


where n, is the number density of particle r (cf., Equation A.19 on page 509). Using also 
that f Zvl, Vaz is the va number operator, N,,, — Nve,, we obtain the propagation 


equation”? 


iZ cult) = (val Hy |v») c(t) + V2GF ( 5 gy ne) Ca(t). (9.88) 


r=e,p,n 


20The correct numerical factor and sign for the matter term were derived in (Barger et al., 1980b) 
and (Langacker et al., 1983), respectively. 
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The vector coefficients are 


1 1 1 
WV = +5 + 2sin” Ow, gy = +5 — 2sin” Ow, v= 
1 1 1 
= =F 2 sin? Ow, gf = Ra — 2sin° Oy, gy =--> (9.89) 
gy =0, 
for VeL, Vu,rL, and sterile vsz, respectively. The signs are reversed for vh, as can be seen 


from the number operator or from (2.302) on page 56. g{f contains an extra +1 from the 
WCC, which makes the effect important for ve © Vu,r- 
Specializing to two families, 


Am? G Am? .: 
—2 cos 20 + En am sin 20 
af elt NL A aa ca(t) (9.90) 
dt c(t) Ree 20 Am2 529 — Ge c(t) ? 
TE sin TE cos — Va 
where 
n= 5 (gf - gr) Nr, (9.91) 
r=e,p,n 


and we have symmetrized the diagonal elements by subtracting the common term 
V2Gr n/2. For an electrically neutral medium, i.e., ne = Np, this yields 


Ne for VeL © VuL; Urb 
n= Ne — inn for VeL + VL ; (9.92) 
1 
—5Nn for VuL, VrL ®© VsL 


with the signs reversed for vp. 

Under the right conditions, the matter effect can greatly enhance the transitions. In 
particular, if the Mikheyev-Smirnov-Wolfenstein (MSW) resonance condition Am cos 20 = 
V2G rn is satisfied, the diagonal elements vanish and even small vacuum mixing angles 
lead to a maximal effective mixing angle. Because of the sign switch, an enhancement for 
vr, corresponds to a suppression for vh and vice-versa (i.e., the presence of matter effectively 
breaks CPT). The matter effect breaks the sign degeneracy for vacuum oscillations, and 
allows a determination of the sign of Am?. Because of the Æ dependence it can lead to 
a distortion in the final neutrino spectra. Finally, if the matter density varies significantly 
along the neutrino path, as is the case for solar neutrinos produced near the solar core, one 
may encounter a level-crossing at the position for which the resonance condition is satisfied 
(for a given E), as illustrated in Figure 9.5. If the density varies sufficiently gradually, 
the transition is adiabatic, i.e., the neutrino remains on one level, with a maximal flavor 
transition probability. A more abrupt (non-adiabatic) transition will have a non-negligible 
probability to jump from one level to another, reducing the flavor transition probability (see, 
e.g., Kuo and Pantaleone, 1989; Strumia and Vissani, 2006; Gonzalez-Garcia and Maltoni, 
2008, 2013; Blennow and Smirnov, 2013). 


9.3.3 Oscillation Experiments 


There have been many experimental searches for and observations of neutrino oscillations 
and transitions (for recent reviews, see Gonzalez-Garcia and Maltoni, 2008; Diwan et al., 
2016; Patrignani, 2016), including experiments at accelerators and reactors, and those in- 
volving solar neutrinos and atmospheric neutrinos (from the decay products of particles 
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To T—> 


Figure 9.5 A level-crossing (resonance) at ro in the presence of matter density that 
decreases with distance r from the neutrino source. The m;(r) are the eigenvalues 
of the matrix in (9.90). (In the full three-neutrino case the state orthogonal to ve 
is actually a linear combination of v, and vr.) 


produced by cosmic ray interactions in the atmosphere). Major observation and exclusion 
regions are plotted in the two-neutrino formalism in Figure 9.6. 


Solar Neutrinos 


Neutrinos (ve) are produced by fusion reactions in main sequence stars by the pp and CNO 
chains, which ultimately lead to 4p > a+2et +2ve. The standard solar model (SSM) (Bah- 
call, 1989; Bahcall et al., 2006; Turck-Chieze and Couvidat, 2011; Haxton et al., 2013), which 
is well tested and constrained by helioseismology and other solar observations,?! and by the 
properties of other stars, is dominated by the pp chain and leads to the predicted solar ve 
spectrum in Figure 9.7. The most important reactions are 


p+p> D+et+n, D+p—> 3He+7, 2 3He > a+ 2p. (9.93) 


The first step leads to the most abundant (pp) neutrinos, which, however, have low energy 
and are hard to detect. About 15% of the time, however, 


3He +a > "Be+y7, "Bee > Li+ ve, (9.94) 


for one of the ?He, leading to the two intermediate energy discrete “Be lines in the ve 
spectrum. Approximately 0.02% of the chains involve the sequence 


"Bet+p— ®B+4, 8B > 8Be* +e + ve, (9.95) 


which leads to the 8B neutrinos. These are insignificant numerically, but because of their 
much higher energy are easiest to detect. The flux of pp neutrinos is well constrained by 
the observed solar luminosity, but the predicted “Be and (especially) 8B fluxes are much 
more uncertain because of their strong dependence on the temperature of the solar core, 
low energy nuclear cross sections, and the solar composition. 

The first solar neutrino experiment was the radiochemical °“Cl experiment, which used 


?1Recent observations of heavy element abundances yield lower values than earlier (less precise) determi- 
nations, creating some tension with helioseismology (e.g., Haxton et al., 2013). 
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Figure 9.6 Neutrino oscillation results, showing the solar/KamLAND and atmo- 
spheric neutrino oscillation regions, the LSND and MiniBooNE regions, and various 
exclusion regions. Plot courtesy of H. Murayama. 
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Figure 9.7 Spectrum of solar neutrinos predicted by the standard solar model. The 
gallium and liquid scintillator experiments are sensitive to the pp and higher energy 
neutrinos, the Homestake chlorine experiment to the higher energy ‘Be line and 
above, and the water Cherenkov experiments to the 8B neutrinos. The other (minor) 
reactions, i.e., the hep, pep, and CNO neutrinos, are described in (Bahcall, 1989). 
Plot reproduced by permission of the AAS from (Bahcall et al., 2005). 


a 10° gallon tank of cleaning fluid placed deep underground in the Homestake gold mine in 
South Dakota to shield from cosmic rays. Ray Davis and collaborators observed the decays 
of the °“Ar atoms produced in the reaction ve +°7Cl — e~ + "Ar and separated chemically 
from the tank. The original goal was to probe the solar interior, but by the early 1970s it 
was apparent that they were only observing ~ 1/3 of the expected flux, beginning a 30-year 
enterprise that ultimately explored both solar and neutrino physics. 

The Homestake results were confirmed in the late 1980s by the Kamiokande II water 
Cherenkov experiment in Japan, which also searched for proton decay and observed neu- 
trinos from Supernova 1987A and atmospheric neutrinos. The reaction ve~ — ve~ was 
mainly sensitive to v.’s, but because of WNC scattering had about 1/7 sensitivity to v,,,, 
which could be produced if the ve’s oscillated. They observed about 1/2 of the expected 
(without oscillations) solar flux, and also confirmed that the v’s actually came from the Sun 
because the e~ direction was correlated with that of the neutrino. These results were later 
confirmed and improved by the successor SuperKamiokande experiment (Abe et al., 2016), 
which extended the analysis to lower energies. 

The water Cherenkov experiments were only sensitive to the upper part of the 8B spec- 
trum and the much rarer hep neutrinos. The Homestake experiment had a lower threshold 
and was sensitive to more of the 8B spectrum and to some extent the “Be and pep neu- 
trinos. The reduced fluxes that they observed could have been due to uncertainties in the 
standard solar model (e.g., by a 5% reduction in the temperature of the core) or other 
astrophysical effects, or to neutrino oscillations/transitions into v,,, Vr, or sterile neutrinos. 
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To distinguish these possibilities, radiochemical experiments on gallium, using the reaction 
vet Ga— e~ + "Ge, were carried out in the 1990s. This has a much lower threshold (233 
keV) than the chlorine reaction (814 keV), allowing detection of the much more numerous 
pp neutrinos. An observation comparable to the SSM prediction would have suggested that 
the Homestake and (Super)Kamiokande deficits were due to astrophysical effects, while a 
reduction comparable to that for 8B would have indicated neutrino oscillations. Eventu- 
ally three gallium experiments, GALLEX and GNO in the Gran Sasso Laboratory in Italy 
and SAGE in the Baksan Laboratory in Russia, were carried out, indicating > 50% of the 
predicted SSM flux. 

No one type of these experiments by itself could definitively exclude an astrophysical 
explanation, especially allowing for large modifications of the SSM, but the three types 
together constituted a rough measurement of the distortion of the neutrino energy spectrum. 
Including the solar luminosity constraint and assuming that plausible astrophysical effects 
would not significantly modify the shape of the 8B spectrum, it was found that the ‘Be 
neutrinos would have had to be suppressed much more than the 8B ones (Hata et al., 
1994). Because the 8B is made from "Be, this effectively excluded astrophysical explanations, 
but allowed neutrino oscillations or transitions (which could modify the shape of the °B 
spectrum). 

The case was clinched a decade later by the SNO (Sudbury Neutrino Observatory) heavy 
water experiment in Ontario (Aharmim et al., 2013; Bellerive et al., 2016). SNO observed 
Cherenkov radiation from electrons and photons from neutron capture, allowing then to 
measure both WCC and WNC scattering from deuterium by 


Ve +D >e +p+p, Vr + D > vr +p+n, (9.96) 


where Vy is any active neutrino. They also measured the electron scattering reaction ve™ > 
ve, consistent with but less precisely than SuperKamiokande. The combination of these 
measurements allowed the SNO collaboration to separately determine the fluxes of ve and 
of V, + v, arriving at the Earth, as shown in Figure 9.8. The result was that the sum of 
the three was consistent with the SSM flux prediction, and that about 2/3 had oscillated or 
been converted into v,,,, (assuming that there are no sterile neutrinos involved), confirming 
both the SSM and neutrino oscillations. 

More recently, the Borexino experiment (Bellini et al., 2014a,b) has studied solar neutri- 
nos by observing ve~ — ve” in a liquid scintillator detector in the Gran Sasso Laboratory. 
Borexino has measured the rates for the upper Be neutrino line, pp neutrinos, and 8B neu- 
trinos at much lower energy than the water Cherenkov experiments. They also measured 
the rare pep neutrinos and set limits on neutrinos from the CNO cycle. 

The survival probabilities P,,_,,, as measured by Borexino and the other experiments 
are shown as a function of neutrino energy in Figure 9.9. 

At various stages the solar neutrino experiments allowed a number of solutions for neu- 
trino oscillation parameters. These included the Am? ~ 107!° eV? vacuum oscillation 
solutions, for which the Earth-Sun distance was of O(Losc). The matter effects were unim- 
portant for these solutions and oscillations during the propagation between the Sun and 
Earth dominated. There were also several solutions with higher Am?, for which matter 
effects were significant. Eventually, the combination of the observed rates from the different 
reactions, as well as limits on the distortion of the 8B spectrum and constraints (Bellini 
et al., 2014a) on or observation (Abe et al., 2016) of energy-dependent day-night asymme- 
tries (due to reconversion to Ve in the Earth), established the large mixing angle (LMA) 
solution characterized by (Bellerive et al., 2016) 


Ams =5.1573x 10° eV", = tan? = 0.427 + 0.028, (9.97) 
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Figure 9.8 Fluxes from electron scattering (ES) and from the WCC and WNC reac- 
tions in the SNO heavy water experiment, compared with the standard solar model 
expectation. Plot reproduced with permission from (Bellerive et al., 2016). 
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Figure 9.9 Survival probabilities P,,,,, as a function of energy, compared with the 
predictions of the large mixing angle (LMA) solution. From (Bellini et al., 2014a). 
Not included is the recent direct measurement of the pp flux from the Phase 2 of 
Borexino (Bellini et al., 2014b). 
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corresponding”? to sin? 0o = 0.299 + 0.014. A more detailed analysis involving all three 
neutrinos yields similar results. This is because 013 in (9.35) turns out to be small, justify- 
ing the two-neutrino formalism as a reasonable first approximation, with the identification 
Am?, ~ Am}, and ĝo ~ 612. 

The solar angle ĝo is large but not maximal, i.e., 9g 4 7/4, so the Ve is predominantly 
vı, while v2 consists mainly of a linear combination of v„ and v,. The characteristics of the 
LMA solution are that a higher energy ve encounters an adiabatic MSW resonance, emerging 
from the Sun in an essentially pure 12 mass eigenstate. This does not oscillate, and has a 
probability sin? 6o ~ 0.30 of interacting as a ve. The density is too low for a resonance for 
the lower energy neutrinos, so they emerge from the Sun as a ve, arriving at the Earth with 
an average survival probability ~ 1— z sin? 205 ~ 0.58. The sign of Am?, > 0 is determined 
from the matter effect. Subsequently, the LMA solution was dramatically confirmed by the 
KamLAND reactor experiment in Japan (see below), which yielded a more precise (but 
slightly higher) Am?,. 

A number of other particle physics interpretations for the solar neutrino deficits and 
other observations have been advanced (e.g., Maltoni and Smirnov, 2016), including oscil- 
lations into sterile neutrinos (Cirelli et al., 2005); resonant spin-flavor transitions involv- 
ing a large magnetic moment (Akhmedov, 1988; Lim and Marciano, 1988; Pulido, 1992; 
Giunti and Studenikin, 2015); neutrino decay (Beacom and Bell, 2002); new flavor chang- 
ing or conserving interactions (Friedland et al., 2004; Miranda et al., 2006; Ohlsson, 2013); 
mass varying neutrinos due to interactions with the environment (Kaplan et al., 2004); 
Lorentz, CPT, or equivalence principle violations (Glashow et al., 1997; Diaz and Kost- 
elecky, 2012); or CPT-violating decoherence between the quantum components of the wave 
function (Barenboim and Mavromatos, 2005). All of these are now excluded as the domi- 
nant effect for solar neutrinos, although they could still exist as perturbations on the basic 
picture. 

Existing and future solar neutrino experiments are reviewed in (Antonelli et al., 2013). 


Atmospheric Neutrinos 


Although the first indications of neutrino oscillations involved the solar neutrinos, the first 
unambiguous evidence came from the oscillations of atmospheric neutrinos (e.g., Kajita, 
2014). Atmospheric neutrinos result from pion and muon decays, which are produced in 
the upper layers of the atmosphere due to the interaction of primary cosmic rays. The 
data from the Kamiokande and SuperKamiokande water Cherenkov detectors indicated the 
disappearance of v, and D, (which will not be further distinguished in this paragraph). 
This was first seen in the ratio of the v,,/v. fluxes, and later confirmed dramatically by 
the zenith angle distribution of v„ events, with SuperKamiokande officially announcing 
their results in 1998 (Fukuda et al., 1998). Other experiments such as IMB, MACRO, and 
Soudan confirmed the results. The details of the SuperKamiokande ve and vy, events, as 
well as evidence for v, appearance and constraints from other experiments show that the 
dominant effect is the oscillations of v, into v;, and not Ve or a sterile vg (Kajita et al., 
2016). SuperKamiokande obtained (Abe et al., 2011) the 90% c.l. ranges 

|Amem| ~ (1.7 — 3.0) x 1073 eV?, sin? 20atm ~ (0.93 — 1.0), (9.98) 


a 


from their 1998-2008 data, corresponding to sin? 041m ~ (0.37 — 0.63) . 
To a good approximation, Am?, can be neglected for the atmospheric neutrinos, justify- 


ing the two-neutrino formalism with Am?2,,,, ~ Am3,. From (9.80) and (9.35) the effective 


?2Various authors quote the neutrino mixings as tan? 0, sin? 6, sin? 20, or even as 0 in degrees. 
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angle for v„ survival is then 
sin? Gin = Vig = Bhatia: (9.99) 


so that Qaim ~ 923 for small 61:3. The data is consistent with maximal mixing (023 = 7/4), 
and in fact the best fit is for that value. There are no Earth matter effects for v, —> vr, so 
the sign of Am3, is not determined in the approximation of neglecting both Am?, and 613. 

The atmospheric neutrino fluxes were subsequently measured in the MINOS de- 
tector (Adamson et al., 2012), the steel-scintillator far detector for the NuMI-MINOS 
long-baseline facility described below. MINOS obtained results consistent with Su- 
perKamiokande. The long-baseline experiments further confirm the atmospheric neutrino 
results, though there is some suggestion of non-maximal mixing. 

Both MINOS and SuperKamiokande also showed that the v, and v, disappearance 
parameters are the same within uncertainties, consistent with CPT. This was done by 
MINOS on an event by event basis by exploiting the magnetization of the detector, while 
SuperKamiokande utilized the differences in the distortions of the zenith angle distributions. 

Additional or future observations of atmospheric neutrinos include the high energy neu- 
trino telescopes described below and the ICAL (Ahmed et al., 2015) detector at the Indian 
Neutrino Observatory (INO). 

Comments similar to those for solar neutrinos concerning alternatives to neutrino os- 
cillations apply to the atmospheric results. It is interesting that the atmospheric neutrino 
oscillations are a quantum mechanical coherence effect on the size scale of the Earth. 


Accelerator Neutrinos 


Early short-baseline accelerator experiments included v, (or D) disappearance experiments 
and searches for Ve or v, appearance at CERN, Brookhaven, and Fermilab (e.g., Gonzalez- 
Garcia and Maltoni, 2008). These typically involved energies in the 1 — 100 GeV range 
and distances 100 — 1000 m, with sensitivities to Am? > 10~' — 1 eV? for large mixing. 
No evidence for oscillations was found (but see the discussion of possible sterile neutrinos 
below). 

More recently, there have been long-baseline experiments involving beams from KEK 
(K2K) and J-PARC (T2K, for Tokai to Kamioka) (Abe et al., 2015a) to SuperKamiokande; 
the Fermilab NuMI beam to MINOS (Timmons, 2016) in the Soudan mine in Minnesota 
and (recently) to NOvA (Adamson et al., 2016a) in Ash River, Minnesota; and the CERN 
CNGS beam to the OPERA and ICARUS detectors in the Gran Sasso Laboratory (for 
reviews, see Feldman et al., 2013; Diwan et al., 2016; Nakaya and Plunkett, 2016). These 
experiments monitor the initial neutrino fluxes by their interactions in a near detector 
close to the accelerator (or by detailed simulations of the CNGS beam). The K2K and 
T2K experiments have baselines L < 300 km, those using the Fermilab-MINOS and CERN 
beams have L ~ 735 km, and Fermilab-NOvA has L ~ 810 km. These are sensitive to 
much lower Am? than the traditional short-baseline accelerator experiments, down into the 
atmospheric neutrino region ~ 107? eV?. The long-baseline experiments have confirmed 
the SuperKamiokande and MINOS atmospheric oscillation results. They have significantly 
reduced the uncertainty in |Am3,| and sin? 023, as can be seen in Figure 9.10. For example, 
T2K obtained (Abe et al., 2015a) 


|Am3,| = 2.51(10) x 1073 eV, sin? 023 = 0.51(6) (9.100) 


at lo from their v, disappearance data, assuming the normal hierarchy, with similar results 
for the inverted. The MINOS and NOvA data also suggest that 023 may be non-maximal. 
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The results are consistent with dominantly v, — v, oscillations,?* and that the v, and 
D, survival probabilities are the same. They have also observed sub-dominant oscillations 
into ve, which are associated with the (small) angle 613, and obtained strong constraints on 
possible sterile neutrinos. 


Normal Hierarchy, 90% CL 
—  NOvA 

T2K 2014 
--- MINOS 2014 


0.3 0.4 0.5 0.6 0.7 
+ ae 
sin 8,4 


Figure 9.10 Allowed regions at 90% c.l. for the atmospheric neutrino oscillation pa- 
rameters from atmospheric and long-baseline data, from (Adamson et al., 2016a). 


The T2K and NOVA experiments both involve off-axis beams, taking advantage of the 
narrower energy spectrum away from the center, and both will benefit from planned up- 
grades to the intensity of the neutrino sources. These and other advantages should allow 
them to simultaneously search for leptonic CP violation (e.g., Nunokawa et al., 2008) and 
determine the type of neutrino hierarchy via matter effects in long-baseline ve and De ap- 
pearance experiments. Both the appearance and disappearance channels will be sensitive to 
023. There are also plans for two very large next generation experiments. One is the Deep 
Underground Neutrino Experiment (DUNE) (Acciarri et al., 2015), featuring a 40 kiloton 
liquid argon detector in the Sanford Laborary in Homestake, 1300 km from Fermilab. The 
other is Hyper-Kamiokande (Abe et al., 2015b), a megaton water Cherenkov detector near 
SuperKamiokande, utilizing a beam from J-PARC. These detectors will also be sensitive to 
proton decay, and to supernova, solar, and atmospheric neutrinos. 


Reactor Neutrinos 


There were a number of early reactor disappearance experiments, which compared the flux 
of De at short distances, L ~ (10 — 100) m, with the theoretical expectations based on the 
known reactor energy output (see, e.g., Gonzalez-Garcia and Maltoni, 2008). These were 
sensitive down to |Am?| = 107? eV? for large mixing. They did not report any evidence for 
oscillations at the time, although a recent reanalysis of the expected fluxes (Mention et al., 
2011) has suggested the possibilitiy of disappearance into sterile neutrinos, as described 
below. 


?3The OPERA hybrid emulsion experiment has directly observed five v, appearance events (Agafonova 
et al., 2015). 
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The subsequent (around 2000) Palo Verde and Chooz experiments had longer baselines 
of O(1 km), and were able to exclude significant De mixing for |Am?| > 1073 eV? (e.g., Qian 
and Wang, 2014). Since the atmospheric neutrino results already established |Am3,| > few 
x10738 eV?, this implied that sin? 20,3 < 0.12 (tan? 013 < 0.032) at 90% c.l., which is small 
compared to the other leptonic mixing angles but comparable to quark mixings. The value 
of 013 is critical: the small value motivated models of neutrino mass in which 013 vanishes 
or is tiny (Section 9.5). Furthermore, the possibility of observing leptonic CP violation 
and significant matter effects in terrestrial experiments depends on having a sufficiently 
large 013. 

For these reasons, an intensive effort was undertaken to observe or constrain 013 for 
|Am?| in the atmospheric neutrino range (Qian and Wang, 2014). Hints of a nonzero value 
were obtained from a global analysis of existing data (Fogli et al., 2008), but the first direct 
experimental evidence was obtained around 2011 by the MINOS and T2K long-baseline 
accelerator experiments, which observed v, — Ve appearance at the several ø level. Sub- 
sequently, the Double Chooz”* reactor experiment reported evidence for De disappearance. 
Finally, the Daya Bay?” reactor experiment in China observed Pe disappearance at greater 
than 50, establishing that 013 4 0. RENO in South Korea also observed disappearance at 
nearly 50. Currently, The Daya Bay value (An et al., 2017) is 


sin? 2013 = 0.084(3) = tan? 6,3 = 0.022(1), (9.101) 


which is 250 away from zero, and also close to the earlier upper limits. They also obtain 
|Am2,| = 2.45(8)[2.56(8)] x 1073 eV? for the normal [inverted] hierarchies, comparable 
to the long-baseline results. The RENO results (Choi et al., 2016) are consistent but less 
precise. Proposed future reactor experiments (e.g., Diwan et al., 2016) include JUNO in 
China and RENO-50 in Korea, each with baselines around 50 km. They would be able to 
distinguish between the normal and inverted hierarchies, independent of CP violation and 
matter complications, by exploiting the difference between |Am3,| and |Am3,|, as well as 
constrain the oscillation parameters. 

Because of the characteristic Am?L/E dependence of neutrino oscillations, long-baseline 
(L ~ hundreds of km) experiments can probe to much lower Am? than the traditional short- 
baseline ones. The KamLAND experiment was a liquid scintillator detector at the location of 
the original Kamiokande detector (Gando et al., 2013). It observed a De flux from a number 
of Japanese reactors, at a typical distance L ~ 200 km, which allows one to probe down 
into the region of the LMA solar neutrino solution. The KamLAND results dramatically 
confirmed the LMA interpretation of the solar neutrino deficit and gave a much more precise 
value for Am, around 7.5(2) x 1075 eV?, as can be seen in Figure 9.11. KamLAND was 
also able to directly observe the L/E dependence expected from neutrino oscillations, and 
to observe 8B and “Be solar neutrinos. 


Geoneutrinos 


Low-energy antineutrinos from radioactive decays in the Earth were initially an important 
background for the liquid scintillator experiments. Both KamLAND (Gando et al., 2011) 
and Borexino (Agostini et al., 2015b) were eventually able to measure the flux of these 


24Double Chooz is a follow up to the Chooz experiment with near and far detectors. The initial result on 
De disappearance was obtained with only the far detector. 

25The Daya Bay experiment consists of a number of near (L ~ 0.5 km) and far (L ~ 1.5 km) detectors 
relative to a reactor complex. Comparison of the fluxes in the near and far detectors essentially eliminates 
the uncertainties from the initial reactor flux. RENO is configured analogously. 
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Figure 9.11 Left: oscillation parameters Am3, (eV?) vs tan? 6,2 determined from 
solar neutrino data at 68, 95, and 99.7% c.l. (egg-shaped contours), compared with 
the KamLAND results (horizontal contours), and the combined fit. From (Bellerive 
et al., 2016). Right: L/E dependence of the De survival probability as determined 
by KamLAND. From (Gando et al., 2013). 


geoneutrinos from the decay chains of ?3?Th and 738U (but not those from 4°’), constrain- 
ing models of the element abundances and interior heating of the Earth. For a review, 
see (Ludhova and Zavatarelli, 2013). 


High Energy Neutrinos 


Ultra-high energy neutrinos, e.g., produced by pion decay, are potentially an extremely 
useful probe of violent astrophysical events. They are not significantly absorbed in inter- 
stellar /intergalactic media, and (unlike cosmic ray protons or nuclei) they are not deflected 
in magnetic fields and therefore point back to their sources. Neutrino telescopes are large de- 
tectors with good ability to measure the direction of the incoming neutrino. These typically 
involve strings of photomultiplier tubes deployed deep underwater (such as the ANTARES 
array in the Mediterranean) or under ice (such as the km? IceCube array at the South 
Pole). The expected cross sections for ultra-high energy neutrinos are reviewed in (Gandhi 
et al., 1998; Formaggio and Zeller, 2012), and the experiments in (Katz and Spiering, 2012; 
Gaisser and Halzen, 2014). 

The IceCube collaboration has observed (Aartsen et al., 2013) several events with ener- 
gies above 1 PeV (10° GeV), the highest energy neutrinos ever observed, as well as many 
additional events with energies above 30 TeV. These are most likely extra-terrestrial, and 
could be due to galactic or extra-galactic astrophysical events, or to exotic particle physics 
such as super-heavy dark matter decays (see Anchordoqui et al., 2014). IceCube has also 
shown (Aartsen et al., 2015) that the flavor ratio of high energy neutrinos arriving at the 
Earth is consistent with the value v./v,,/v; = 1/1/1 expected for stable neutrinos produced 
by pion decay in distant sources when oscillations are taken into account. (Other sources, 
such as unstable neutrino decays, could lead to other ratios (e.g., Bustamante et al., 2015).) 
IceCube has also observed atmospheric oscillations at much lower energies (but still higher 
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energy than the other atmospheric experiments), constrained oscillations into sterile states, 
and searched for neutrinos from dark matter annihilation in the Sun. It may also be sensitive 
to the Glashow resonance (Glashow, 1960), in which the cross section for De scattering is 
greatly enhanced at 6.3 PeV due to the resonant scattering Yee" — W~ — X from electrons 
in the atmosphere; to nonstandard interactions; and to neutrinos from galactic supernova. 
The proposed PINGU upgrade to the inner detector (Aartsen et al., 2017) would lower 
the energy threshold to around 5 GeV, allowing the determination of the mass hierarchy 
through resonant matter effects, and possibly allowing Earth tomography. 

The proposed KM3NeT underwater telescope in the Mediterranean (Adrian-Martinez 
et al., 2016) and the Lake Baikal-GVD facility in Russia would have similar capabilities. 


9.3.4 Possible Sterile Neutrinos 


There are several indications of mixing between active and light (eV scale) sterile neutrinos, 
as well as many null experiments. For reviews, see, for example, (Abazajian et al., 2012; 
Palazzo, 2013; Kopp et al., 2013; Gariazzo et al., 2016). 

The LSND experiment at Los Alamos observed a 3.80 excess of Pep — etn events in a D, 
beam obtained from jut decay at rest (Aguilar-Arevalo et al., 2001). This suggested P, > De 
oscillations” with L ~ 30 m and L/E ~ (0.4— 1.5) m/MeV, corresponding to |Am? 5x p| ~ 
(0.2 — 10) eV? and small mixing. This was not confirmed by the KARMEN2 experiment 
at Rutherford, but there was a small parameter region allowed by both. The LSND result, 
along with the solar and atmospheric oscillations, would imply three or more distinct Am?’s, 
and therefore at least four light neutrinos that mix with each other. The extra neutrinos 
would have to be sterile because the invisible Z width result Ni”” = 2.992 + 0.007 does not 
allow a fourth light active neutrino. 

Subsequently, the Fermilab MiniBooNE experiment searched for ve appearance in a Vy 
beam with L ~ 541 m and E, ~ (20— 1250) MeV, and later for Ve appearance in a P, beam. 
The final MiniBooNE results?” (Aguilar-Arevalo et al., 2013) showed excesses in both ve and 
De -like events. The De excess was consistent with the LSND oscillation signal, though with 
a lower statistical significance. The energy dependence of the ve excess was only marginally 
consistent with the other results, however, at least within the framework of a single sterile 
neutrino. 

|Am? syp] and the associated E/L are large compared to |Am?2,,,,| and Am3. Assuming 
the existence of a single sterile neutrino vg, one can therefore treat the three largely-active 
states as degenerate and use the effective two-neutrino formalism in Equations (9.76)— 
(9.80), with |A] = |m} - m? 5 3| = |AmZ sn p| ~ (0.2— 10) eV’, where m4 is the mass of the 
additional (largely sterile) mass eigenstate. The effective mixing for Dy, > De and Vy —> Ve 
appearance is therefore 


1 
sin? 29e = 4 sin? Opp sin? Oee ~ i sin? 20u Sin? 2bee, (9.102) 


26Other possibilities to account for LSND, e.g., involving CPT violation, decoherence, new interactions, 
extra dimensions, mass-varying neutrinos, and hybrids, are surveyed in (Gonzalez-Garcia and Maltoni, 
2008). 

27The initial MiniBooNE Vu — Ve analysis was restricted to Ey > 475 MeV so as to coincide with the 
LSND L/E range. No excess was observed in this range, although there was an anomalous excess of Ve 
events at lower energies. The excess observed in the subsequent P, runs stimulated considerable discussion 
of possible CP violation to account for the v — D difference. However, the final MiniBooNE ve analysis 
included events with H, down to 200 MeV. The resulting excess was mainly due to (but did not fully 
account for) the low energy anomaly. The LSND and MiniBooNE experiments are reviewed in (Conrad 
et al., 2013b), while the experimental and theoretical status of low-energy v scattering is reviewed generally 
in (Garvey et al., 2015). 
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where sin? Gin = yal? sin? fee = |Ve4|2, and the last expression assumes small mixing. 
But sin? Oupa and sin? ce, respectively, control the vy, and ve survival probabilities; e.g., 
the mixing for ve disappearance into vg is sin? 20.5 = 4|V*,Vs4|? ~ 4sin? bee. Thus, sterile- 
induced oscillations v, —> ve necessarily imply disappearance of both v, and ve into vg, and 
also the absence of any C P-violating difference between v and 7, as long as the effective two- 
neutrino approximation is valid. For additional sterile neutrinos one can still usually treat 
the (mainly) active states as degenerate. The LSND-MiniBooNE signal still typically leads 
tov, and ve disappearance, although the additional parameters would allow for cancellations 
and for observable differences between v and P. 

Short-baseline (L < 100 m) reactor experiments should be sensitive to De disappearance 
into a sterile neutrino for Am? in the LSND range. However, in existing experiments the 
measured fluxes must be compared with theoretical Pe spectra obtained from detailed mod- 
eling of the relevant fission decay chains in the reactors. A recent reanalysis (Mention et al., 
2011) obtained a predicted flux about 3% higher than earlier estimates (see also Huber, 
2011), leading to the reactor anomaly: whereas the previous measurements were consistent 
with no De disappearance (P;,-45, = 0.976 0.024), the new theoretical spectrum suggested 
a 2.50 deficit (0.943 + 0.023), which could be due to oscillations into sterile neutrinos.?° 
Some caution is required, however, since a later study (Hayes et al., 2014) argued that the 
systematic uncertainties in the flux are as large as the anomaly. The situation is further 
complicated by the observation by Daya Bay, RENO, and Double Chooz of a statistically 
significant enhancement (“bump”) in the Pe flux in the 4-6 MeV region, which is not ex- 
plained in the flux models. The entire situation is reviewed in (Hayes and Vogel, 2016). 

Additional evidence for sterile neutrino oscillations comes from the gallium anomaly. The 
SAGE and GALLEX solar neutrino experiments each used intense radioactive Ve sources 
(510r, 37 Ar) of known intensity as a cross check on their efficiencies and on the theoretical 
estimate of the cross section for ve +7!Ga > e` + ™Ge. The observed rates were around 
2.80 below the expectation (Gariazzo et al., 2016), suggesting P,..,, = 0.84+0.05 (or that 
the other uncertainties are underestimated). 

There have also many negative searches for oscillations involving sterile neutrinos. These 
include searches for ve or De disappearance in accelerator and longer-distance reactor ex- 
periments, and global analyses of the solar neutrinos. Similarly, there are many null results 
on V, and P, disappearance?’ from short- and long-baseline accelerator experiments and 
atmospheric neutrinos, as well as searches for V, —> Ve or Vy (Ve) appearance beyond the 
three flavor expectations by OPERA (ICARUS) in the CNGS beam. Especially strong lim- 
its on v, disappearance by MINOS and Pe disappearance by Daya Bay constrain sin? Oi 
and sin? Oee, respectively, and these can be combined in a joint analysis (Adamson et al., 
2016b) to exclude a significant region in the sin? 26,,. — Am2; plane. IceCube (Aartsen et al., 


2016) has also exploited the nonobservation of resonant matter effects on atmospheric 7 
to strongly constrain sin? Oun- These and earlier results are in considerable tension with the 
LSND-MiniBooNE data in the case of one sterile neutrino, and exclude most ot the low 
Ami, (high sin? 20e) region, as shown in Figure 9.12. A region around Am3, ~ 1.6 eV? 
and sin? 20e ~ 0.0015 appears to be the best compromise (e.g., Kopp et al., 2013; Gariazzo 
et al., 2016; Collin et al., 2016). Somewhat better agreement can be obtained if there are 
two or three sterile neutrinos (Conrad et al., 2013a), though this conclusion is weakened if 
one does not include the anomalous MiniBooNE low-energy data (Gariazzo et al., 2016). 
As discussed in the sections on neutrino counting (page 385) and neutrino mass 


28The value of 613 is insensive to the sterile mixing. 
2b, oscillations into vy can be distinguished from vg statistically by neutral current scattering, matter 
effects, and 7 production. 
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Figure 9.12 Left: 90% c.l. exclusion regions for sterile neutrino oscillations from 
the combined MINOS-Daya Bay analysis (to the right of the outer contour), con- 
trasted with the allowed LSND and MiniBooNE regions (narrow, diagonal regions), 


from (Adamson et al., 2016b). Right: 30 allowed region from short-baseline 7 
appearance experiments other than the anomalous low-energy MiniBooNE results 
(egg-shaped contour), and the 30 exclusion regions from disappearance experiments 


(right of the leftmost line). The limits from Vi, (nearly vertical) and W. are also 
shown. The overall best fit is the cross around sin? 26,,¢ ~ 0.0015, Am3} ~ 1.6 eV?. 
From (Gariazzo et al., 2016. Copyright IOP Publishing. Reproduced with permis- 
sion. All rights reserved.) 


(page 387) there are cosmological implications for both the numbers and masses of sterile 
neutrinos. In particular, in the standard cosmological scenario the sterile neutrinos should 
have thermalized efficiently by active-sterile mixing for the parameters suggested by Fig- 
ure 9.12, i.e., Ts ~ T, so that NOM? ~ 4 and met ~ m4 2 1 eV, in tension with the 
Planck limits in (9.59). The situation would be worse if there are additional eV-scale ster- 
ile neutrinos with significant mixing. These constraints could be relaxed or evaded if the 
production is somehow suppressed, e.g., by large lepton asymmetries, late-time phase tran- 
sitions, a low reheating temperature after inflation, or time-varying masses (for reviews, see, 
e.g., Langacker, 2005; Gariazzo et al., 2016). The possibility that one or more sterile states 
are lighter than the active ones is mentioned in the next section. 

The situation concerning possible light sterile neutrinos and their mixing with active ones 
is confusing, and no compelling picture has emerged. Definitive experiments, e.g., involving 
intense radioactive sources, experiments very close to reactors, or specially-configured ac- 
celerator experiments, are needed to resolve the situation (for a review, see Gariazzo et al., 
2016). It is also challenging to construct a theory in which active and sterile neutrinos mix, 
as will be discussed in Section 9.5. 
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9.4 THE SPECTRUM 


Three Active Neutrinos 


Most data other than the LSND results can be accommodated by three active neutrinos. The 
solar and atmospheric mass-squared differences in (9.97) and (9.98) allow several possible 
patterns for the spectrum. Assuming that the absolute masses are comparable to the mass 
splittings, the sign ambiguity in Am?, allows either the normal hierarchy (NH) or the 
inverted hierarchy (IH), as illustrated in Figure 9.13. The normal hierarchy is most similar 
to the quark spectrum, but the analogy is poor since the CKM mixing angles are all small. 
In both cases, the data is compatible with mo = 0, where mo is the lowest mass, i.e., 
Mo =m, (mg) for the normal (inverted) hierarchy. 

It is also possible that the absolute masses are larger than the mass differences (the 
degenerate spectrum), perhaps as large as a few tenths of an eV each if one stretches the 
cosmological limits on © = J`; |m,| in (9.58). Either sign for Am, is still allowed, and both 
the NH and the IH smoothly merge into the degenerate spectrum as mo increases. 
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Figure 9.13 Left: the normal hierarchy for three neutrinos. Right: the inverted hier- 
archy. The vertical-dashed, open, and horizontal-ruled regions indicate the central 
values of |Vei|?, |Vuil?, and |V;i|?, respectively, using the best fit parameters. The 
degenerate case corresponds to adding a large common mass to each state. In this 
context v3 is usually defined as the isolated state rather than heaviest. 


A recent global three-neutrino analysis (Esteban et al., 2017) obtained 
Am}, ~ 7.50(18) x 107° eV?°, Am3, ~ 2.52(4) x 1073 eV”, 8 = 261(55)° 
sin? 019 ~ 0.31(1), sin? 023 ~ 0.44(2), sin? 013 = 0.022(1), (9.103) 


assuming the normal hierarchy (Am?, = Am3, > 0). The results are similar for the inverted 
hierarchy except Am2, = —2.51(4) x 1078 eV? < Am3,, sin? 023 ~ 0.59(2), and ô = 
277(43)°. The x? distributions for the parameters are shown in Figure 9.14. The analyses 
in (Forero et al., 2014; Capozzi et al., 2016) obtain similar results. 

From these results it is apparent that 


There is a slight (Ay? ~ 0.8) preference for the normal hierarchy. 
613 is nonzero but small. 


The solar angle 612 is large but non-maximal. 
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Figure 9.14 Ay? distributions (with repect to the best fit), from (Esteban et al., 
2017), for the normal (lower Ay?) and inverted hierarchies. 


There is a preference for non-maximal 0993, driven by NOvA and MINOS, but not 
T2K, as can be seen in Figure 9.10. However, maximal mixing (023 = 7/4) is still 
allowed at 2c. 


The first (second) octant for 023 is preferred for the normal (inverted) hierarchy. 


There is a preference for the Dirac CP phase 6 to be nonzero and to lie between 1.17 
and 1.87. However, ô = 7 and 27 (i.e., no CP violation) are allowed at 2c, and the 
entire range 0 — 27 at 30. 


For mı = 0 in the normal hierarchy one finds mə ~ 0.009 eV and m3 ~ 0.050 eV, with 
m2/m3 ~ 0.17, which is quite large compared to the analogous quark and charged lepton 
mass ratios. For the inverted hierarchy, m3 = 0 implies mı ~ 0.049 eV and mz ~ 0.050 eV. 

The predicted values for © are shown as functions of the lightest mass mo and compared 
with existing and projected experimental constraints in Figure 9.15. The existing cosmo- 
logical limits on (which, however, have a large theoretical uncertainty) are somewhat 
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weaker than the expected values for three neutrinos except for a degenerate spectrum with 
mo Z 0.05 eV. Future observations may be sensitive to the entire mass range. 
For three neutrinos, the effective Go, mass is 


mgg = [E Vem, 


~ ai my, +e”? 52, m + e752, mal, (9.104) 


where c13 ~ 1 in the last expression. The predicted range is shown and compared with 
experiment in Figure 9.15. For the normal hierarchy the predicted range is very small, 
mgg < few x107? eV, because of the small s73, and it could even vanish due to cancellations 
for nonzero mı. The range is below the sensitivity of the next generation of experiments. 
mgg is much larger for the inverted hierarchy, however. Up to small corrections from the last 


2i(a2—a1) 2 


term, it is given by ee m,+e s?4 m2|, which can vary from ~ (c?, — s?,) |Am3,|1/? 


to |Am3,|!/?, i.e., (0.02 — 0.05) eV, depending on the relative Majorana phases (a2 — a4). 
This should be within the reach of existing and planned po, experiments, at least in their 
later phases. Both hierarchies smoothly merge in the degenerate region, where mgg ranges 
approximately from (c? — s72) mo ~ 0.4m to mo. 

Existing and projected 8 decay experiments are only sensitive to three-neutrino masses 
in the degenerate region, as can be seen in Figure 9.15. There, the effective 8 decay mass 
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Figure 9.15 Predictions for the total mass © (left) and the effective 660, mass mgg 
(right) for three neutrinos, as a function of the lightest mass mo, for the normal 
hierarchy (NH) and the inverted hierarchy (IH). The angles and masses are allowed 
to vary over their lo ranges, while the phases a ,2 and 6 vary from 0 to 27. The light 
gray band in the © plot is an approximate spread of upper limits from cosmology, 
corresponding to different data sets and priors. The lower dashed line is an estimate 
of future sensitivity. The vertical line is the upper limit on my, ~ mo anticipated 
from tritium 8 decay. The upper (lower) gray horizontal bands on the right repre- 
sent the existing experimental upper limit (anticipated future sensitivity), with the 
widths due to uncertainties in the nuclear matrix elements. 
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Additional Sterile Neutrinos 


The LSND, MiniBooNE, and reactor and gallium anomaly results, if interpreted in the 
context of neutrino oscillations, require the existence of one or more light sterile neutrinos 
that mix significantly with active neutrinos of the same helicity. One additional sterile 
neutrino could be accomodated in either the 2 + 2 or the 3+ 1 patterns, illustrated in 
Figure 9.16. The 2+ 2 case would require that the solar and/or atmospheric results involve 
a significant admixture of oscillations into the sterile neutrino. However, it is well established 
that neither the solar nor the atmospheric oscillations are predominantly into sterile states, 
and this scheme is excluded. In the 3+ 1 schemes a predominantly sterile state is separated 
from the others by ~ |[Am2 sy p|!/?. However, as mentioned in the Section 9.3.4 any scheme 
with just one relevant sterile neutrino is in strong tension with other experiments. One can 
generalize to 3 + 2, 3 +3, or 3 +n schemes with additional sterile states heavier than the 
three active ones, possibly obtaining better agreement with the other experiments, but at 
the expense of increased tension with the cosmological limits. One can also consider schemes 
in which the sterile neutrinos are lighter than the active ones, or in which some are lighter 
and some heavier. However, these are even more difficult to reconcile with the cosmological 
limit on E, since each of the three active states must be heavier than |Am7,¢y p|!/?. 

Sterile neutrinos can give significant contributions to the effective masses for 6 de- 
cay (Esmaili and Peres, 2012) and Go, in (9.56) and (9.60), especially if there are more 
than one (e.g., Barry et al., 2011). However, mgg can also vanish due to cancellations (i.e., 
if the (e,e) element of the Majorana mass matrix vanishes). 
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Figure 9.16 Left: an example of a 2+ 2 pattern. Right: a 3+ 1 pattern. The dashed 
(solid) lines indicate qualitatively the fractions of sterile (active) neutrinos prior 
to small mixings. Other patterns correspond to the inverted hierarchy for the at- 
mospheric neutrinos or to placing the largely sterile state in the 3+ 1 case on the 
bottom. 


9.5 MODELS OF NEUTRINO MASS 


There are an enormous number of models of neutrino mass and mixing, as reviewed in (Mo- 
hapatra and Smirnov, 2006; Mohapatra et al., 2007; Strumia and Vissani, 2006; Albright, 
2009; Ma, 2009; Altarelli and Feruglio, 2004, 2010; Langacker, 2005, 2012; Barger et al., 
2012; Antusch, 2013; King, 2015; de Gouvéa, 2016). Here we mention the major issues and 
possibilities. 
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9.5.1 General Considerations 


There are a number of general issues concerning neutrino mass, including: 


— Are the masses Dirac or Majorana? 


The only distinction between Dirac or Majorana neutrinos in the standard model is 
the type of mass term and associated Yukawa interactions.2° That is, both active and 
sterile neutrinos are described by Weyl two-component spinors, and in the massless 
limit any sterile spinors decouple. 


Majorana masses have several advantages: they are not forbidden by any unbroken 
gauge symmetry, the active Majorana masses can be naturally small by the seesaw 
mechanism, and they may be connected to the leptogenesis mechanism for the baryon 
asymmetry. However, Dirac masses cannot be excluded: Majorana masses can be 
forbidden in a field theory by a conserved global lepton number symmetry,*! there are 
a number of possible mechanisms for small Dirac masses, and there are alternative 
mechanisms for the baryon asymmetry (Chapter 10). 


— Why are the masses so small compared with those of the quarks and charged leptons? 


One possibility is that the neutrinos masses are associated with higher-dimensional 
operators in the fundamental or effective low-energy theory, i.e., they are suppressed 
by powers of S/M, where M is a large new-physics scale and S$ < M. Usually, S 
is associated with a symmetry breaking scale, such as the SU(2)-breaking scale for 
the seesaw model or the scale of a new symmetry that forbids the lowest-order Dirac 
Yukawa coupling. The latter possibility generally implies the existence of new physics 
around S. 


Another mechanism involves exponential suppressions. These can occur, for example, 
in some superstring theories in which underlying U(1) gauge symmetries are broken at 
the string scale. These can re-emerge as global symmetries at the perturbative level of 
the low-energy theory, broken by exponentially-small D instantons (e.g., Blumenhagen 
et al., 2009b; Cveti¢ and Halverson, 2011). 


Other possibilities include small wave function overlaps in extra dimensions, couplings 
that only occur at loop level, and anthropic arguments (Tegmark et al., 2005). 


— Why are the masses (or at least the mass-squared differences) in a hierarchical structure? 


Mass hierarchies may arise from the Froggatt-Nielsen mechanism (Froggatt and 
Nielsen, 1979), i.e., different elements of a Dirac or Majorana mass matrix are sup- 
pressed by different powers of a discrete or continuous symmetry-breaking parameter. 
This parameter (or spurion) is often regarded as the vacuum expectation value of 
a scalar field known as a flavon.”? An alternative possibility involves wave function 
overlaps or particle locations in extra dimensions. For example, three-point vertices 
in intersecting brane string constructions can be large for particles close to each other 
or exponentially suppressed if they are well separated. Both possibilities are similar 


30There could also be a distinction if there are BSM interactions. 

311t is generally believed that conserved global symmetries are not possible in string theory or other theo- 
ries of quantum gravity (Banks and Dixon, 1988; Witten, 2001; Banks and Seiberg, 2011). However, models 
descended from string constructions may suppress Majorana masses because of underlying symmetries and 
selection rules. In particular, gravitationally-induced Majorana mass terms for the active neutrinos may be 
of O(v?/Mp) < 1075 eV, and perhaps very much smaller, i.e., they may be too small to be relevant. 

32 Another possibility is that the Yukawa couplings are actually the vacuum expectation values of a number 
of flavons, determined by the minimization of a potential (Alonso et al., 2013b). 
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in spirit to (and can occur simultaneously with) mechanisms for the small overall 
neutrino mass scale mentioned above, though they may differ in detail. 


In fact, the neutrino mass hierarchies are not particularly large compared to those 
for the quarks and charged leptons (|Am3, /Am3,|1/? ~ 5.8), and it is possible that 
their masses are essentially random numbers (anarchy) (Hall et al., 2000; de Gouvéa 
and Murayama, 2015; Lu and Murayama, 2014). An anarchical spectrum could be 
associated with superstring theory, which may have an enormous landscape of possible 
vacua, each with different values of the parameters. 


— Why do the leptonic mixings have the observed pattern, and are they related to the 
(apparently different) pattern of quark mixings? 


The large mass hierarchies and small mixing angles in the quark sector suggest that 
the two are closely related, e.g., they may both be associated with different orders of 
family symmetry breaking, or with geometrical effects in extra dimensions. 


The situation is less clear for the neutrinos, for which there are two large and one small 
mixing angle, and only moderate hierarchies. Most theoretical work assumes that the 
structure of the leptonic mixing can be described by an underlying (probably broken) 
continuous or discrete symmetry, and in many but not all models the same symmetries 
account for the mass hierarchies. However, the observations are also consistent with 
random (anarchical) values for the parameters. 


The general issues of the fermion spectrum and mixings are further discussed in Chap- 
ter 10. 


— Is there leptonic CP violation? 


CP violation in the quark sector was surprising when it was first observed, and it 
leads to the question of whether similar effects are present in the leptonic sector. CP 
violation is also necessary for leptogenesis, though not necessarily in the PMNS matrix. 
With the benefit of hindsight, however, CP breaking might have been anticipated: 
many parameters in field theory are complex numbers, and C'P violation is expected 
if the system is sufficiently complicated that the complex phases cannot all be absorbed 
by field redefinitions. It is now understood that no such phases are possible in QCD,°° 
QED, or in the weak neutral current. However, there can be observable phases in the 
quark and lepton mixings. Almost any model of neutrino masses and mixings can lead 
to CP violation unless CP is explicitly imposed. 


We now turn to specific classes of models. 


9.5.2 Dirac Masses 


There are several possibilities for suppressing h, in the Yukawa interaction 
—Lyuk = V2h, (Z.dun + Prdttr) (9.105) 
(cf., Equation 8.13 on page 260) to obtain a small Dirac mass. Usually one assumes that h, 


is forced to vanish at tree-level by some additional symmetry, such as an extra U(1)’ gauge 
symmetry and/or due to an underlying superstring construction. It is possible, however, 


33 However, the small value of the strong CP parameter discussed in Chapter 10 is not understood. 
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that higher-dimensional operators of the form 


—Lupo = V2 7 (o8l.dun + oP onder) (9.106) 
are allowed by the symmetries (e.g., Cleaver et al., 1998; Langacker, 2012), where ¢g is some 
SM-singlet spin-0 field and M is a new physics scale, such as the Planck mass. This is illus- 
trated for p = 1 in Figure 9.1. If ds acquires a VEV (¢s) = S, an effective Yukawa coupling 
hvas = Up(S/M)? will be generated, which can be very small for S < M. For example, 
Tp =1, M = Mp, p = 1, and §S at an intermediate scale ~ 10’ GeV yields m, ~ 0.1 eV. 
Another possibility is that h,,,, is only generated at loop level, e.g., in a theory with an 
extended Higgs sector, or in a supersymmetric theory with heavy exotic quarks (Masiero 
et al., 1986) or with non-holomorphic soft scalar interactions (Demir et al., 2008). Small 
Dirac masses may also emerge in higher-dimensional theories, e.g., in which vz is confined 
to our 4-d brane, but vp is free to propagate in the extra dimensions, in which case h, 
is determined by the (possibly very small) overlap of their wave functions (Dienes et al., 
1999b; Arkani-Hamed et al., 2002b). Related string-inspired possibilities involve large inter- 
section areas in intersecting brane constructions (Blumenhagen et al., 2005, 2007b) or string 
instantons (Cvetié and Langacker, 2008), both of which yield exponential suppressions.*4 

It is possible that a dominant Dirac mass term is perturbed by a much smaller Majo- 
rana mass term for the active and/or sterile spinor. This is the pseudo-Dirac case mentioned 
on page 380. It implies that the Dirac neutrino splits into two Majorana neutrinos, each 
approximately half active and half sterile, with a small mass difference given by the perturb- 
ing Majorana mass. However, this would lead to an additional mass-squared difference that 
would have been observed by solar neutrinos oscillating into the sterile component unless 
the Majorana mass is smaller than O(10~° eV) (de Gouvêa et al., 2009). 


9.5.3. Majorana Masses 


Majorana masses for active neutrinos violate weak isospin by one unit. A simple way to 
generate them at the renormalizable level is to extend the SM by the addition of a complex 
Higgs triplet $r = (6%, op, bp )*, with tr = 1, and yr = —1. The Yukawa couplings of 
or to the leptons are 


1. 4 
=L = ght LLT- or la + h.c. 


B 9.107) 
eee E (A) S ( 
= zhr (Pr (ge ae + h.c., 


with @, = ¢3, ¢9 = oa (or — id), and gp = (Or + i¢?.), and where i is de- 
fined in (9.1) on page 374. This yields a Majorana mass mr = —hrvg, if ¢% acquires 
a VEV v4,/ V2, as illustrated in Figure 9.3. The constraints on hp are less stringent 
than those for a Dirac mass since one can have |vg,| < v ~ 246 GeV (the parameter 
po = Mj, /(MZ cos? 0w) requires |vy,.| £ O(10~?)v (Problem 8.1)). The original version 
of the model (Gelmini and Roncadelli, 1981; Georgi et al., 1981) involved a global lepton 
number symmetry (the coupling in (9.107) is invariant since dr can be defined to have 
L = 2). Therefore, the spontaneous breaking implies a massless Goldstone boson, known 
as the triplet Majoron. Astrophysical constraints (see, e.g., Raffelt, 1999) on stellar cooling 


Cc 
—VR 


34For a review of superstring phenomenology, see (Ibáñez and Uranga, 2012). Implications of string theory 
for neutrino mass are reviewed in (Langacker, 2012). 
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from Majoron emission required |vg,| < O(10) keV, consistent with small neutrino mass. 
However, there was no special reason for V, to actually be that small. The triplet Majoron 
model was eventually excluded by the invisible Z width, as already discussed below (8.170) 
on page 309, because the decay into a Majoron and light scalar would have a partial width 
equal to that of two additional active neutrinos. 

The triplet model could survive, however, by adding a coupling 


—Leoor z Voor =h PF. or ot he. (9.108) 


between ¢r and the Higgs doublet ¢, where « has dimensions of mass. The clash between 
(9.108) and (9.107) implies that the lepton number is explicitly violated, yielding a mass 
for the Majoron that can be taken large enough to evade observational bounds. A currently 
popular version (the type IT seesaw (Ma and Sarkar, 1998; Hambye et al., 2001)) includes 
a mass term lu2gi - or for the triplet, where u2 is very large and positive and associated 
with a new physics scale. Substituting? 6° + v/v2 in (9.108), Vs. becomes linear in ¢%, 
so that it is forced to acquire a VEV 


Kv? kv? 
V, =- 5 mT = hr —. (9.109) 
or a Lp 


This is very small in the seesaw limit, 17./K > v. 

Instead of a Higgs triplet, a Majorana mass for active neutrinos can be generated by the 
Weinberg operator (Weinberg, 1980), which is a higher-dimensional operator? in which two 
Higgs doublets are combined as an isospin triplet, as illustrated in Figure 9.3. The form of 
the interactions can be obtained from (9.107) if we identify ¢/, with ¢'r‘¢, where the Higgs 
doublet ¢ and its tilde form ¢ are ¢ = (¢+ $°)? and ¢ = (¢°t — ¢~)* as defined in (8.14) 
on page 260. Then, 


Li £ C i (#78) + h.c. = £ (4 3) (o a) + h.c. 


Cex oto- OF got N (9.110) 
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where M is the relevant new physics scale and C is a coefficient that can be absorbed into M 
if desired. The second form is obtained using the SU (2) Fierz identity in Problem 1.1. (The 
second term in the Fierz identity vanishes for a single Higgs doublet.) Thus, the Majorana 
mass is mp = —Cv?/M. For example, M ~ 1019 GeV (the Planck scale) and C ~ 1 implies 
mr ~ 10~° eV. Log describes an effective theory that can be generated by many different 
underlying models, including the type II seesaw mentioned above. It is easily generalized 
to three families, and is considered by many to be the favored description of small neutrino 
masses. The scale of M/C (around 10' GeV for mp ~ 0.1 eV) is suggestive of, though a 
few orders of magnitude below, a grand unification or superstring scale. 

The most familiar implementation of (9.110) is in the minimal or type I see- 
saw (Minkowski, 1977; Gell-Mann et al., 1979; Yanagida, 1979; Schechter and Valle, 1980), 


Me 


35Qne should in principle minimize V w.r.t v and Vg, simultaneously. In practice the modification to v 
from the coupling in (9.108) is usually negligible in the seesaw limit, and one can always adjust the other 
Higgs parameters to maintain the observed value of v. 

36The Weinberg operator, with dimension k = 5 (i.e., the coefficient has mass dimension 4 — k, as in 
Section 2.13), is the lowest-dimensional extension of the standard model (with no vp) that is consistent 
with the gauge and Lorentz symmetries. Higher-dimensional SM operators relevant to neutrino mass are 
classified in (Babu and Leung, 2001; de Gouvéa and Jenkins, 2008). 
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in which the active neutrinos mix with heavy sterile Majorana neutrinos,®” as in (9.25) or 
(9.41) and Figure 9.3, leading to Majorana masses of O(m?,/mg). In some versions there 
is a spontaneously broken global lepton number, leading to a very weakly coupled Gold- 
stone boson, the singlet Majoron (Chikashige et al., 1981). Type I seesaw models have been 
constructed with many different scales,’ depending on mp. For example, choosing mp 
comparable to the electron mass and m, ~ 0.1 eV implies mg ~ few TeV. However, most 
popular are those based on grand unified theories, where the masses in the Dirac matrix 
Mp are typically of O(mu,c,z). The sterile masses in Ms may be generated by large Higgs 
multiplets (e.g., the 126 of SO(10)) or by higher-dimensional operators, and must typically 
be several orders of magnitude below the grand unification scale Mx, which is > 101° GeV 
in the supersymmetric case (Section 10.4). Such constructions are usually combined with 
family symmetries that restrict and relate the elements of Mp, Mgs, and the quark and 
charged lepton mass matrices, and sometimes combine the type I and II seesaws (using 
the Higgs triplets in the 126). Generating the large observed leptonic mixings in the same 
GUT context that yields small quark mixings is a challenge that requires carefully chosen 
family symmetries and/or non-symmetric (lopsided) mass matrices (Albright et al., 1998). 
The latter leads to large mixing in the unobservable A% and in the observable A$ mixing 
matrices. One especially attractive aspect of the minimal seesaw is that it opens the possi- 
bility of leptogenesis (Fukugita and Yanagida, 1986; Davidson et al., 2008; Hambye, 2012; 
Fong et al., 2012) for generating the observed baryon asymmetry of the Universe (Section 
10.1). 

Most of these GUT type models are hard to embed in known classes of superstring 
constructions, where any underlying grand unification is frequently broken in the higher- 
dimensional theory, and in any case it is difficult or impossible to obtain large representations 
like the 126. More promising is that the Majorana mass terms in Ms are generated by higher- 
dimensional operators in the superpotential?’ of the form (¢%°'/M%) Pvp, in which q+1 
SM singlet fields és (which need not be the same) acquire VEVs somewhat below the Planck 
scale Mp. This mechanism can lead to a sufficiently suppressed Mgs, but is likely to lose the 
GUT and family symmetry structure of the SO(10) models. String constraints sometimes 
forbid the simultaneous existence of such operators and those needed for Mp (Giedt et al., 
2005). However, there are a few successful examples (Lebedev et al., 2008) in which both Mp 
and Ms emerge at high order and there are O(100) sterile neutrinos. Other possibilities for 
generating Ms involve higher-dimensional operators in the Kahler potential (Arkani-Hamed 
et al., 2001b) or string instanton effects (Blumenhagen et al., 2007a; Ibáñez and Uranga, 
2007). 

Seesaw models with lower scales have also been considered. For example, in some gauge 
extensions of the SM (e.g., with an extra U(1)’ (Kang et al., 2005)) the “sterile” neutrinos 
are charged under the extended group, so any Majorana masses for them cannot be much 
larger than the symmetry breaking scale. There are also various extended seesaw models, 
which involve additional SU (2) singlets. For example, in the double seesaw (Mohapatra and 
Valle, 1986) one introduces a sterile ng with a Majorana mass u and a Dirac mass Mp 


37In the type IIT seesaw (e.g., Ma, 2009) the sterile neutrino is replaced by an SU(2) triplet. 

38The heavy sterile neutrino aggravates the Higgs/hierarchy problem (Section 10.1) unless mg < 107 
GeV (Vissani, 1998). It also tends to destabilize the electroweak vacuum (Elias-Miro et al., 2012). The 
lifetime was estimated to be sufficiently long for mg < 1013 — 1014 GeV, but see the caveats in Section 8.5.3. 

39The Weinberg operator could also emerge directly in a superstring construction. However, C/M would 
typically be ~ 1/Mp, which is too small. This difficulty could be avoided if some of the extra dimensions 
are large compared to the inverse of the string scale (Conlon and Cremades, 2007; Cvetié et al., 2010). 
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coupling to vp: 


1 0 mp 0 VR 
z= 3 ( DL D ng ) mp 0 Mp Ve | thc. (9.111) 
0 Mp H nR 


where we have taken the other masses to vanish. For u = 0 there is one massless Weyl 
neutrino and one Dirac neutrino, given approximately by vz and nz + vr respectively for 
Mp > mp. For y < mp « Mp the vy acquires a Majorana mass m, ~ pu(mp/Mp)?. 
This inverse seesaw is actually driven by a small Majorana sterile mass u rather than a 
large one, e.g., = 1 keV, mp = 100 GeV, and Mp = 10 TeV corresponds to m, ~ 0.1 
eV. This model generalizes to three families. Variations of the double seesaw with larger p 
and related models with small Dirac masses are also possible. Low-scale models may also 
have implications for LHC physics. They are reviewed in (Chen and Huang, 2011; Boucenna 
et al., 2014). 

We briefly mention two other classes of models that lead to Majorana masses. One 
involves masses that are only induced at loop level, e.g., associated with an extended scalar 
sector. In one well-known example (Zee, 1980) an SU(2)-singlet charged scalar field A7 
is introduced that couples to both leptons and Higgs doublets, leading to a loop-induced 
Majorana mass as shown in Figure 9.3 on page 377. This example actually leads to off- 
diagonal masses such as Pez V p, i.e, the ZKM model of (9.44) on page 383, and also requires 
a second Higgs doublet, because the h~ coupling is antisymmetric in lepton and Higgs 
family indices. Another possibility occurs in supersymmetric theories with R-parity violation 
(Section 10.2), which allows mixing and a type of seesaw between active neutrinos and 
neutralinos. The mixing can generate one small neutrino mass at tree level, with the other 
two masses entering at loop level (e.g., Grossman and Rakshit, 2004; Rakshit, 2004). 


9.5.4 Mixed Mass Models 


As discussed in Section 9.3.4 there are several experimental suggestions of possible mixing 
between active and light sterile neutrinos. Neutrino oscillations conserve helicity, so the 
mixing must be between states of the same chirality. Most extensions of the standard model 
involving neutrino mass introduce sterile neutrinos, which could in principle have mass at 
any scale. However, generating significant mixing is more difficult. For example, in the model 


with one active and one sterile neutrino in (9.18) on page 378 the appropriate v} — v°° or 


vie —v, mixing only occurs in the general case in which both Dirac and Majorana mass terms 
are present. Furthermore, for eV-scale sterile neutrinos with small mixing both the Dirac and 
Majorana mass terms must be tiny and not too different in magnitude. Similar statements 
apply to the multi-family case in (9.37). More generally, LSND-type active-sterile mixing 
requires two distinct types of mass terms to be simultaneously extremely small. These could 
be Majorana and Dirac, or, alternatively, two distinct types of Dirac masses, such as one 
connecting active and sterile states, and another connecting distinct steriles. Confirmation 
of such mixing would therefore require a major change in the paradigm, especially from the 
usual seesaw model. 

There have been many models for such mixing, including higher-dimensional operators 
associated with an intermediate scale, sterile neutrinos from a mirror world, supersymmetry, 
extra dimensions, and dynamical symmetry breaking (see, e.g., Langacker, 2012, for a list 
of references). Especially promising is the (minimal) mini-seesaw model. This is just the 
ordinary seesaw model in (9.25) or in (9.41) (with mr or Mr = 0), except that all of the 
Dirac and Majorana masses are assumed to be very small, e.g., < O(eV). The minimal 
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mini-seesaw has the advantage that the masses and mixings are related in a way that is 
roughly consistent with the LSND and other observations. For example, for one family one 


has 
2 


Jmil~ 2P, ma~ms, jo~ Z (9.112) 
ms ms 

from (9.25), e.g., |mi| ~ 0.04 eV and |8| ~ 0.2 for mp = 0.2 eV and mg = 1 eV. There 
have been extensive studies of the minimal mini-seesaw for two or three families*® (e.g., 
de Gouvéa and Huang, 2012; Donini et al., 2012), yielding qualitative agreement with the 
data. The minimal mini-seesaw predicts mgg = 0 for the effective 66, mass in (9.60), 
because for mass eigenvalues < MeV it is just the (vanishing) (e,e) element of the mass 
matrix in (9.37). 

The minimal mini-seesaw model parametrizes the mixing but does not explain the small 
values for mp and mg. Just as in the discussion of small Dirac masses, it suggests that 
both the Dirac Yukawa couplings and mg are forbidden at the renormalizable level by some 
new discrete, global, or gauge symmetry, and only generated by symmetry-breaking (and 
possibly loop) corrections (e.g., Langacker, 1998, 2012; Sayre et al., 2005). For example, 
consider the case of one active and one sterile neutrino in which the lowest-order allowed 
mass terms are 


q+1 r—-1 


s SS vie + Ur os CO ; (#78) + h.c., (9.113) 
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rather than those in (9.105), (9.9) and (9.110). The new symmetry is broken by (ds) = S, 
and M is a new physics scale.*! Neglecting the last term, (9.113) yields a minimal mini- 
seesaw for p,q > 1. Taking p = q = 1 and Tp = I's = 1, for example, one finds mp = Sv/M 
and mg = S?/M, so that |0| ~ v/S and |mi| ~ v?/M. Comparing with the numerical 
example above, one expects S in the TeV range and S/M ~ 10~!”. In general, one should 
not neglect the last term in (9.113). Any multiplicative symmetry of Lmini that allows the 
first two terms will also allow the third, with r = 2p — q. This corresponds to r = 1 in the 
p = q = 1 case, i.e, the new physics should be able to generate a contribution to m; directly 
that is of the same order as the mixing-induced term. This would modify the details of the 
minimal mini-seesaw model (and would allow mgg # 0), but the general idea remains. 


9.5.5 Textures and Family Symmetries 


There are also many texture models, involving specific ansatze about the form of the 3 x 3 
neutrino mass matrix, or of the Dirac and Majorana mass matrices entering seesaw models. 
These are often studied in connection with models also involving quark and charged-lepton 
mass matrices, such as family symmetries, left-right symmetry, or grand unification (Chap- 
ter 10). A major complication is that the form of a mass matrix depends on the basis 
chosen, e.g., whether the charged-lepton mass matrix is diagonal, and any underlying fam- 
ily symmetries may take a different form depending on the basis. Family symmetries may 
be continuous (global or gauged) or discrete. There has been considerable recent interest 
in discrete symmetries (for reviews, see Altarelli and Feruglio, 2010; Ishimori et al., 2010; 
King et al., 2014) because they more naturally lead to the large mixing angles observed 


40From (9.40) the leading approximation to the active-sterile mixing for both the ordinary type I and 
mini seesaws is —MpMs,". Mp can be expressed in terms of the mass eigenvalues and PMNS matrix up 
to a complex orthogonal matrix (Casas and Ibarra, 2001). 

41 An explicit example of an ultraviolet completion, i.e., a renormalizable model of the underlying physics 
that leads to the effective operators in Lmini, is given in (Heeck and Zhang, 2013). 
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in the neutrino sector. Also, they are free of the unwanted Goldstone bosons associated 
with spontaneously broken global continuous symmetries. Family symmetries are in prin- 
ciple symmetries of the underlying Lagrangian, but in practice the models are often very 
complicated, so studies are often restricted to the symmetry and its breaking pattern. 

The observed values of 623 ~ 7/4 and 013 ~ 0 suggest that the leptonic mixing matrix 
is close to 


C12 $12 A 
v=v=|-2 2 2 (9.114) 
$12 C12 1 


v2 v2 v2 
up to field redefinitions and possible Majorana phases that are irrelevant for oscillations. The 
motivation for (9.114) is weakened by the observation of nonzero 613 and likely deviation of 
023 from maximal mixing, but it may still be useful as a starting point in model building. 
The special case of sî, = 1/3, known as tri-bimaximal mizing (Harrison et al., 2002; Ma, 
2004), is consistent with but slightly above the current value in (9.103). Alternatively, the 
golden ratio prediction s2, = i ~ 0.276 (Kajiyama et al., 2007; Feruglio and Paris, 2011; 


Ding et al., 2012), is slightly below. Bimazimal mixing, cj, = st, = 1/2 was also at one 


time a serious possibility. These and similar patterns may be associated with underlying 
permutation symmetries, such as $4, or groups of even permutations, such as Ay or (for the 
golden ratio) the icosahedral group As. 

In practice, small deviations from any specific form such as tri-bimaximal mixing are 
possible. For example, they may be associated with broken discrete symmetries, and/or 
they could apply only to the neutrino mixing, with additional small mixing comparable to 
the CKM angles induced by the charged leptons [Cabibbo haze (Datta et al., 2005; Everett, 
2006)]. It has been observed that 812 is close to the quark-lepton complementarity (Raidal, 
2004; Minakata and Smirnov, 2004) value 612 + 0. = 7/4, where 6. is the Cabibbo angle, 
reopening the possibility of bimaximal mixing in the neutrino sector. Similarly, 013 is close 
to the value 6./ /2 that has been motivated in several schemes (Minakata and Smirnov, 
2004; Antusch, 2013). Renormalization group evolution of couplings is another complication, 
since one might expect underlying symmetries to apply at a large GUT or string scale rather 
than at low energies, and mass degeneracies (in sign as well as magnitude) at a high scale 
may be destabilized by the running (e.g., Antusch et al., 2005). 

In most quark texture models the small mixing angles are associated with the small ratios 
of mass eigenvalues, as in Problem 9.7. Two of the neutrino mixings are large, however, 
suggesting the possibility that the mixings and mass eigenvalues are decoupled. This can 
easily occur. In particular, any Dirac, triplet Majorana, or seesaw-induced 3 x 3 neutrino 
mass matrix of the form 


A B -B 
M,=| B C -D (9.115) 
-B -D C 


will lead (e.g., Altarelli and Feruglio, 2010) to the leptonic mixing matrix in (9.114), corre- 
sponding to §13 = 0 and 823 = C23 = 1/V2, with 


8B? 


in? 2012 = 9.116 
Sn 4012 = TA— (C + D)? + 8B? ele) 

The mass eigenvalues are related by 
A = Êa mı + SipMa2, C+D = sfo mı + a Mo, C — D = ms, (9.117) 


which because of the phase conventions chosen in (9.114) may be negative or complex. 
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Clearly, the parameters can be chosen to yield any of the types of mass hierarchy. Tri- 
bimaximal mixing occurs whenever 


A+B=C+4+D, (9.118) 
implying 
(9.119) 
mı =A-—B, My = A+ 2B, m3 =C -— D. 


Bimaximal mixing corresponds to 


A=C+D, (9.120) 
so that R 
=e. P mı 2 = AF V2B, mz =C—D. (9.121) 
The golden ratio occurs for 
A=C+D-V2B. (9.122) 


The form of (9.114) and (9.115) exhibits an obvious v,, + v, interchange symmetry, up 
to signs depending on our phase convention (Xing and Zhao, 2016). 


9.6 IMPLICATIONS OF NEUTRINO MASS 


Most extensions of the SM predict non-zero neutrino masses at some level, so it is difficult 
to determine their origin. Many of the promising mechanisms, such as the minimal seesaw, 
involve very short distance scales, e.g., associated with grand unification or string theories, 
and are therefore difficult to verify directly. Some models lead to other types of predictions. 
For example, lepton flavor violating processes’ like u > ey, uN —> eN, or u > 3e, by loop 
effects involving sneutrinos (Č) and other superpartners in supersymmetry (e.g., Hisano 
et al., 1996; Casas and Ibarra, 2001; Masiero et al., 2004; de Gouvéa and Vogel, 2013) may 
be observable, but their connection to the neutrino mass generation mechanism is model 
dependent. Lepton-flavor violating effects or direct production of heavy Majorana neutrinos 
at hadron colliders (e.g., leading to same-sign dileptons) may also be observable in TeV-scale 
models for neutrino mass (Cheng and Li, 1977; Atre et al., 2009; Alonso et al., 2013a). 
There are many unanswered questions. These include: 


— Are the neutrinos Dirac or Majorana? 


Majorana masses, especially if associated with a type I or II seesaw, would allow the 
possibility of leptogenesis. The observation of 389, would establish Majorana masses 
(or at least L violation), but foreseeable experiments will only be sensitive to the 
inverted or degenerate hierarchies. Dirac masses would suggest that additional TeV- 
scale symmetries or string symmetries/selection rules are forbidding Majorana mass 
terms and suppressing the Dirac Yukawa couplings. 


— What is the absolute mass scale (with implications for cosmology)? 


This is very difficult, but ordinary and double beta decay experiments, as well as 
future cosmological observations, may be able to establish the scale. 


42The nonzero neutrino masses and mixings themselves violate lepton flavor, but their effects are negligible 
except for neutrino oscillations. 
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— Is there leptonic CP violation? What is 6? Is the hierarchy normal or inverted? Is 623 
maximal? 


Leptonic CP violation is a necessary ingredient in leptogenesis. The CP phases in 
Y are different from the ones relevant to leptogenesis, though they are often related 
in specific models. For three families, the phase 6 in (9.35) may be observable, e.g., 
in differences between the v, —> Ve and D, — De oscillation rates in long-baseline 
experiments such as T2K and NOvA, especially with their intensity upgrades, or 
in future experiments with even longer baselines. Such effects are proportional to 
$13, but fortunately the nonzero value measured in reactor experiments is sufficiently 
large. They also depend on the sign of Am3, because of matter effects in the Earth. 
The nature of the hierarchy may be determined simultaneously with CP breaking in 
long-baseline experiments, in other matter effects involving atmospheric or supernova 
neutrinos, in future high precision reactor experiments, in the observation of 38o, if the 
neutrinos are Majorana, or by cosmological determinations of © = X; |m;|. The value 
of 623 will also be measured more precisely in future long-baseline D, disappearance. 
On a longer time scale, CP violation, the hierarchy, and related issues may also be 
addressed at a dedicated neutrino factory (from a muon storage ring), or in beta beams 
involving D, emission from accelerated heavy ions. 

Ultra-high energy neutrinos from violent astrophysical events can be observed in large 
detectors in ice or water, such as IceCube and KM3NeT, and in fact IceCube has al- 
ready detected a number of events in the 30 TeV-PeV range. These may possibly shed 


light on neutrino oscillations or decay, nonstandard properties, and on the astrophys- 
ical sources. 


— Are there light sterile neutrinos? 


If the LSND, MiniBooNe, reactor, and gallium results are confirmed, it will suggest 
mixing between ordinary and sterile neutrinos, presenting a serious challenge both to 
particle physics and cosmology, or imply something even more bizarre, such as CPT 
violation. 


— Are there any new v interactions or anomalous properties such as large magnetic mo- 
ments? 
Most such ideas are excluded as the dominant effect for the solar and atmospheric 
neutrinos, but could still appear as subleading effects. 


Future experimental and observational prospects relevant to these issues are reviewed 
generally in (de Gouvêa et al., 2013), and the hierarchy determination in (Cahn et al., 2013; 
Qian and Vogel, 2015; Patterson, 2015). 


9.7 PROBLEMS 
9.1 Show that (9.13) is a solution to the free Majorana field equation (9.12). 


9.2 Show that adding a small 23 element to the (symmetric) ZKM matrix in (9.45) (a 
generalization of the pseudo-Dirac case), will lead to three Majorana mass eigenvalues even 
though none of the diagonal elements is nonzero. 


9.3 Suppose the WCC in the second form in (9.49) contained a small admixture of V + A, 


Thy = ey" [(1— 7°) +e(1+7°)] vm, 
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with e < 1. Calculate the leading correction to the t-channel amplitude in (9.52) for 
W-W- — ee” and show that it survives even for mr = 0. The interpretation is that 
the amplitude for the diagrams in Figures 9.2 or 9.4 requires a chirality flip on the internal 
neutrino line, which can be due either to a Majorana mass or to a new interaction. (This 
mechanism does not occur in normal SU(2); x SU(2)r x U(1) models, because the V + A 
interactions involve vg rather than vp.) 


9.4 Show that the neutral current coupling 7, y"vz in (8.74) for an active massive neutrino 
can be written as $7p7"(1—°)vp (i.e., V — A) if it is Dirac, and as —$7y"7°vm (pure 
axial) if it is Majorana, where vp = vz, + vr and vm = Vr +Vp. 


9.5 Suppose there existed a fourth stable active neutrino with mass m, in the GeV range 
but small compared with Mz/2 (this is now excluded by the measurements of the invisible 
Z width, but was once considered an interesting possibility). Similar to the discussion 
in Problem 5.5, the relic density of such neutrinos depends on the thermal average of 
G(vpV¥p —> X)ßrei if the neutrino is Dirac, or o(vmvm > X) Bret if it is Majorana, where 
Bret is the relative velocity, X represents a sum over accessible final states, the energy is 
at or just above threshold, s = 4m2, and one averages over the (thermalized) initial spins. 
Calculate the leading contribution to re; in the CM via the s-channel Z annihilation 
diagram at threshold in each case, where X = ff is a light fermion with neutral current 
couplings given by (8.75). Show that it is S-wave (constant) for the Dirac case and P- 
wave (x 6? = 82/4) in the Majorana case. Neglect my. (This calculation was used in the 
derivation of the Lee-Weinberg (and others) bound (e.g., Kolb and Turner, 1990; Dolgov, 
2002), m, > 2 GeV (Dirac) or m, > 8 GeV (Majorana), to avoid overclosing the Universe, 
assuming that the neutrino is stable and heavier than O(100) eV and that Q,h? < 1.) 


9.6 Derive (9.74). 


9.7 As a toy model of a quark texture, consider the empirical Oakes relation sin ĝe ~ 
Mr/Mg (Oakes, 1969), where 0, is the Cabibbo angle. Show how this might emerge from the 
three-quark Fermi theory described in Section 7.2 if one assumes that for some reason my, ~ 
M$ ~ 0. Hint: recall the relation between the quark and pseudoscalar masses described in 
Section 5.8. 
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Beyond the Standard Model 
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10.1 THE NEED FOR NEW PHYSICS 


The structure of the electroweak part of the standard model was explored in detail in 
Chapter 8, but for convenience we summarize the Lagrangian density after spontaneous 
symmetry breaking:! 


L= Leauge + Leo + yo ( p- Mr — mE Ur 
r (10.1) 
g g u 
2/2 ( Io oo 
where the self-interactions for the W*+, Z, and y are given in (8.42) and (8.43), Ly is given 
in (8.39), and the fermion currents in (8.60), (8.70), and (8.74). 

The standard electroweak model is a mathematically-consistent renormalizable field the- 
ory that predicts or is consistent with all experimental facts. It successfully predicted the 
existence and form of the weak neutral current; the existence and masses of the W and Z 
bosons; and those of the charm quark, as necessitated by the GIM mechanism. The charged 
current weak interactions, as described by the generalized Fermi theory, were successfully 
incorporated, as was quantum electrodynamics. The consistency between theory and exper- 
iment indirectly tested the radiative corrections and ideas of renormalization and allowed 
the successful approximate prediction of the top quark and Higgs boson masses. Although 
the original formulation did not provide for massive neutrinos, they are easily incorpo- 
rated by the addition of right-handed states vp (Dirac) or as higher-dimensional operators, 
perhaps generated by an underlying seesaw (Majorana). When combined with quantum 
chromodynamics for the strong interactions, the standard model is almost certainly the 
approximately correct description of the elementary particles and their interactions down 
to at least 107-'°cm, with the possible exception of new very weakly coupled particles. 
When combined with general relativity for classical gravity the SM accounts for most of the 
observed features of Nature (though not for the dark matter and energy). 

However, the theory has far too much arbitrariness to be the final story. For example, the 
minimal version of the model has 20 free parameters for massless neutrinos and another 7 
(9) for massive Dirac (Majorana) neutrinos, not counting electric charge (i.e., hypercharge) 


Hy Wy + JWE) - eB Ay — 


2 
Lis, is given by $0. 2pLi Puri — imor (Privep + h.c.) (i + H) for Majorana vzr, masses generated 
by a higher-dimensional operator involving two factors of the Higgs doublet, as in the seesaw model. 
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assignments.” Most physicists believe that this is just too much for the fundamental theory. 
The complications of the standard model can also be described in terms of a number of 
problems. 


10.1.1 Problems with the Standard Model 
The Gauge Symmetry Problem 


The standard model is a complicated direct product of three subgroups, SU(3) x SU(2) x 
U(1), with separate gauge couplings. There is no explanation for why only the electroweak 
part is chiral and parity-violating. Similarly, the standard model incorporates but does not 
explain another fundamental fact of Nature: charge quantization, i.e., why all particles have 
charges that are multiples of e/3. This is important because it allows the electrical neutrality 
of atoms (|gp| = |qe|). The complicated gauge structure suggests the existence of some 
underlying unification of the interactions, such as one would expect in a superstring (e.g., 
Green et al., 1987; Polchinski, 1998; Becker et al., 2007; Ibáñez and Uranga, 2012) or 
grand unified theory (Section 10.3). Charge quantization can also be explained in such 
theories, though the “wrong” values of charge emerge in some constructions due to different 
hypercharge embeddings or non-canonical values of Y (e.g., some string constructions lead 
to exotic particles with charges of +e/2). Charge quantization may also be explained, at 
least in part, by the existence of magnetic monopoles (e.g., Preskill, 1984) or the absence 
of anomalies,’ but either of these is likely to find its origin in some kind of underlying 
unification. 


The Fermion Problem 


All matter under ordinary terrestrial conditions can be constructed out of the fermions 
(ve,e-,u,d) of the first family. Yet we know from laboratory studies that there are > 3 
families: (v,,, u~,c,s) and (v+, T™,t,b) are heavier copies of the first family with no obvious 
role in Nature. The standard model gives no explanation for the existence of these heavier 
families and no prediction for their numbers. Furthermore, there is no explanation or pre- 
diction of the fermion masses, which are observed to occur in a hierarchical pattern that 
varies over 5 orders of magnitude between the t quark and the e7, or of the quark and lep- 
ton mixings. Even more mysterious are the neutrinos, which are many orders of magnitude 
lighter still. It is not even certain whether the neutrino masses are Majorana or Dirac. A 
related difficulty is that while the CP violation observed in the laboratory is well accounted 
for by the phase in the CKM matrix, there is no SM source of CP breaking adequate to 
explain the baryon asymmetry of the universe. 

There are many suggestions for new physics that might shed light on these questions. 
The existence of multiple families could be due to large representations of some grand unified 
theory. Alternatively, it could be associated with different possibilities for localizing particles 
in some higher-dimensional space, which could be associated with a superstring compact- 
ification, extra-dimensional grand unification, or by some effective brane world scenario 
involving large* (Arkani-Hamed et al., 1998; Dienes et al., 1999a) and/or warped (Randall 


212 fermion masses (including the neutrinos), 6 mixing angles, 2 CP violation phases (+ 2 possible 
Majorana phases), 3 gauge couplings, My, v, 99¢p, Mp, Acosm, minus one overall mass scale since only 
mass ratios are physical. 

3The absence of anomalies is not sufficient to determine all of the Y assignments without additional 
assumptions, such as family universality. 

4See (Witten, 1996; Lykken, 1996b) for early explorations of large dimensions associated with low su- 
perstring scales, and (Berenstein, 2014) for modern developments. 
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and Sundrum, 1999) extra dimensions. See, e.g., (Hewett and Spiropulu, 2002; Gherghetta, 
2011; Ponton, 2013; Csáki and Tanedo, 2015; Patrignani, 2016) for general reviews and 
(Adelberger et al., 2009) for laboratory constraints. The hierarchies of masses and mix- 
ings could emerge from wave function overlap effects in such higher-dimensional spaces, or 
as exponential suppressions associated with intersection areas in the internal dimensions 
in intersecting brane constructions (e.g., Blumenhagen et al., 2005). Another interpreta- 
tion, also possible in string theories, is that the hierarchies are because some of the mass 
terms are generated by higher-dimensional operators and therefore suppressed by powers of 
(0|S|0)}/Mx, where S is some standard model singlet field and Mx is some large scale such 
as Mp. The allowed operators could perhaps be enforced by some family symmetry (Frog- 
gatt and Nielsen, 1979). Radiative hierarchies (e.g., Babu and Mohapatra, 1991), in which 
some of the masses are generated at the loop level, or some form of compositeness are 
other possibilities. Despite all of these ideas there is no compelling model and none of these 
yields detailed predictions. Grand unification by itself doesn’t help very much, except for 
the prediction of mp in terms of m, in the simplest versions. 

As discussed in Chapter 9 the small values for the neutrino masses suggest that they 
are associated with Planck or grand unification physics, as in the seesaw model, but there 
are other possibilities. 

Almost any type of new physics is likely to lead to new sources of CP violation. 


The Higgs/Hierarchy Problem 


In the standard model one introduces an elementary Higgs field to generate masses for the 
W, Z, and fermions. The observed mass My ~ 125 GeV of the physical Higgs boson is of 
the same order as the W mass. In fact, this is required by consistency for a weakly coupled 
theory: if My, had been orders of magnitude larger than My the Higgs self-interactions 
would have been excessively strong. This is manifested by a theoretical upper limit of 
O(700 GeV) on the mass of an elementary Higgs (see Section 8.5.1). 

However, there is a complication. The tree-level (bare) Higgs mass receives quadratically- 
divergent corrections from loop diagrams such as those in Figure 10.1. One finds (Sec- 
tion 10.2.1) 

Mẹ, = (Mẹ )bare + O(A, 97, h?)A?, (10.2) 


where A is the next higher scale in the theory. If there were no higher scale, one could simply 
interpret A as an ultraviolet cutoff and take the view that My is a measured parameter, 
with (Mz)pare not observable. However, the theory is presumably embedded in some larger 
theory that cuts off the momentum integral at the finite scale of the new physics.” For 
example, if the next scale is gravity, A is the Planck scale Mp = Ge 2 ~ 10!9 GeV. In 
a grand unified theory, one would expect A to be of order the unification scale Mx ~ 
1014 GeV. Hence, the natural scale for My is O(A), which is much larger than the expected 
value. There must be a fine-tuned and apparently highly contrived cancellation between the 
bare value and the correction, to more than 30 decimal places in the case of gravity. If the 
cutoff is provided by a grand unified theory there is a separate hierarchy problem at the 
tree-level. The tree-level couplings between the Higgs field and the superheavy fields lead 
to the expectation that My is close to the unification scale unless unnatural fine-tunings 
are done, i.e., one does not understand why (My /Mx)? is so small in the first place. 

The naturalness paradigm (e.g., Giudice, 2008; Feng, 2013; Dine, 2015a), i.e., that the 


5 As discussed in Section 2.12.2, there is no analogous fine-tuning associated with logarithmic divergences. 
In QED, for example, corrections are of O(aln(A/me_-)), which is < 1 even for A = Mp. Similarly, from 
(5.38) the QCD coupling can become strong at O(1 GeV) without excessive tuning, e.g., for as(Mp) ~ 0.02. 
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Figure 10.1 Radiative corrections to the Higgs mass, including self-interactions, in- 
teractions with gauge bosons, and interactions with fermions. 


qualitative features of Nature should not depend on fine-tuned cancellations or on very 
large or small dimensionless ratios, has often served as a guide to possible new physics. For 
example, the observed value of the Kz — Kg mass difference motivated the prediction of the 
charm quark (the GIM mechanism), which enforced the exact cancellation between much 
larger contributions and allowed an approximate prediction of me (Section 8.6.3). 

Most of the solutions to the Higgs/hierarchy problem are similarly motivated by natu- 
ralness.° In TeV scale supersymmetry, for example, the quadratically-divergent contribu- 
tions of fermion and boson loops cancel, leaving only much smaller effects of the order of 
supersymmetry-breaking. (However, supersymmetric grand unified theories still suffer from 
the tree-level hierarchy problem.) There are also (non-supersymmetric) extended models in 
which there are cancellations between bosons or between fermions, at least up to a Higgs 
compositeness scale (Section 8.5.3). This class includes Little Higgs models (Arkani-Hamed 
et al., 2002a), in which the (composite) Higgs is forced to be lighter than new TeV scale 
dynamics because it is a pseudo-Goldstone boson of an approximate underlying global sym- 
metry, and Twin-Higgs models (Chacko et al., 2006). For a review, see (Csáki et al., 2016). 

Large and/or warped extra dimensions can also resolve the difficulties by providing a 
cutoff at a fundamental scale Mr much lower than the Planck scale. The suppression of 
Mr/Mp may be associated with some (possibly fractional) inverse power of the size of a 
large extra dimension compared to Me (which, however, introduces its own naturalness 
problem), or may involve an exponential suppression in a gravitationally-warped dimension. 
Deconstruction models, in which no extra dimensions are explicity introduced (Arkani- 
Hamed et al., 2001a; Hill et al., 2001), are closely related. 

Most of the models mentioned above have the potential to generate flavor changing 
neutral current effects and EDMs much larger than observational limits. Pushing the mass 
scales high enough to avoid these problems may conflict with a natural solution to the hier- 
archy problem, i.e., one may reintroduce a little hierarchy problem. Many are also strongly 
constrained by precision electroweak physics. In some cases the new physics does not sat- 
isfy the decoupling theorem (Appelquist and Carazzone, 1975), leading to large oblique 
corrections (Section 8.3.6). In others new tree-level effects may again force the scale to be 
too high. The most successful from the precision electroweak point of view are those which 


SOf course, there was no Higgs/hierarchy problem in Higgsless models or models with dynamical sym- 
metry breaking (Section 8.5.3), but the observation of the Higgs forces us to take it seriously. 
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have a discrete symmetry that prevents vertices involving just one heavy particle, such as 
R-parity in supersymmetry, T-parity in some little Higgs models (Cheng and Low, 2003), 
and K K-parity in universal extra dimension models (Appelquist et al., 2001). 

A very different possibility is to accept the fine-tuning, i.e., to abandon the notion of 
naturalness’ for the weak scale, perhaps motivated by anthropic considerations (Agrawal 
et al., 1998; Hall et al., 2014). (The anthropic idea will be considered below in the discussion 
of the gravity problem.) This could emerge, for example, in split supersymmetry (Arkani- 
Hamed and Dimopoulos, 2005). 

Another (relazion) mechanism that avoids the need for new physics at the TeV scale 
will be mentioned in the next subsection. 


The Strong CP Problem 


Another fine-tuning problem is the strong CP problem (e.g., Dine, 2000; Ramond, 2004; 
Peccei, 2008; Kim and Carosi, 2010). One can add an additional term® “gop GĞ to the 
QCD Lagrangian density that breaks P, T and CP symmetry, as in (5.2) on page 160. 
Giu = €yvpaG'?? /2 is the dual field strength tensor introduced in (5.7). As mentioned in 
Section 8.6.5, such a term would induce an electric dipole moment dn for the neutron. The 
stringent limits on dn lead to the upper bound |§gcp| < 107+? — 1071. The question is, 
therefore, why is 0gc¢p so small? It is not sufficient to just say that it is zero (i.e., to impose 
CP invariance on QCD) because of the observed violation of CP by the weak interactions, 
which is believed to be associated with phases in the quark mass matrices. The quark phase 
redefinitions that remove them lead to a shift in gcp by O(10~*) because of the anomaly 
in the vertex coupling the associated global current to two gluons, as discussed following 
(5.130). An apparently contrived fine-tuning is therefore needed to cancel this correction 
against the bare value. Solutions include the possibility that CP violation is not induced 
directly by phases in the Yukawa couplings, as is usually assumed in the standard model, 
but is somehow violated spontaneously. 0gcp then would be a calculable parameter induced 
at loop level, and it is possible to make gc p sufficiently small. However, such models lead 
to difficult phenomenological and cosmological problems.? Alternatively, @gcp becomes 
unobservable (i.e., can be rotated away by the quark phase redefinition) if there is a massless 
u quark (Kaplan and Manohar, 1986). However, most phenomenological estimates (Aoki 
et al., 2017) are not consistent with mu = 0. 

Another possibility is the Peccei-Quinn mechanism (Peccei and Quinn, 1977), in which 
an extra global U(1) symmetry is imposed on the theory in such a way that 9gcp becomes a 
dynamical variable that is zero at the minimum of the potential. The spontaneous breaking 
of the symmetry, along with explicit breaking associated with the anomaly and instanton 
effects, leads to a very light pseudo-Goldstone boson known as an axion (Weinberg, 1978; 
Wilczek, 1978). It was initially assumed that the scale fa at which the U(1) symmetry 
is broken would be comparable to the electroweak scale v, but this was soon excluded 
experimentally. Still allowed is the invisible axion, involving a very large fa >> v and a very 


7Naturalness ideas will be further tested at the LHC and other upcoming experiments, and are a major 
motivation (e.g., Arkani-Hamed et al., 2016) for even higher-energy facilities, such as O(100 TeV) pp colliders 
that have been proposed at CERN and in China. 

8One could add an analogous term for the weak SU(2) group, but it does not lead to observable conse- 
quences, at least within the SM (Anselm and Johansen, 1994; Dine, 2000). 

°Models in which the CP breaks near the Planck scale may be viable (Nelson, 1984; Barr, 1984). 
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light axion, with a mass 


Ma 


\VMu/Ma frMr 6 eV (= =) l (10.3) 


E 1+ mMu/ma V2 fa ~ 


estimated using chiral SU (2)z x SU (2)r considerations. Axion couplings also scale as f+. 
Laboratory, astrophysical, and cosmological constraints suggest that fa is in the range 
10° — 101? GeV, corresponding to Ma ~ (107° — 107?) eV. 

Axions and similar axion-like particles (ALPs), which are not necessarily associated with 
the strong CP problem and for which (10.3) need not hold, often occur in superstring theo- 
ries (Svrcek and Witten, 2006). Experimental searches typically involve an electromagnetic 
coupling to Fa FH” (i.e., to E. B), analogous to the GG coupling. This can induce axion- 
photon conversions in a strong magnetic field, which can lead to resonant excitations in a 
high-Q microwave cavity. Some ALP’s might even allow “light shining through a wall”, i.e., 
conversions of a photon to an axion and then a reconversion to a photon on the opposite 
side of a barrier. Coherent oscillations of a background axion field could also possibly gener- 
ate oscillating electric dipole moments (Graham and Rajendran, 2013). Searches for axions 
and ALPs produced in the Sun, the laboratory, or from the early universe are reviewed 
in (Asztalos et al., 2006; Kim and Carosi, 2010; Jaeckel and Ringwald, 2010; Essig et al., 
2013; Graham et al., 2015a; Patrignani, 2016). Their cosmological consequences, especially 
as viable dark matter candidates, are reviewed, e.g., in (Kawasaki and Nakayama, 2013; 
Marsh, 2016). 

A novel solution to the Higgs/hierarchy problem involves coupling the Higgs to the QCD 
axion field, known in this context as the relaxion (Graham et al., 2015b). The coupling is 
such that the Higgs mass-squared changes from positive to negative as the relaxion evolves 
from an initial large value cosmologically, and then shuts the evolution off due to QCD 
effects soon after the Higgs acquires its VEV. 


The Gravity Problem 


Gravity is not fundamentally unified with the other interactions in the standard model, 
although it is possible to graft on classical general relativity!? by hand. However, general 
relativity is not a quantum theory, and there is no obvious way to generate one within the 
standard model context. Loop quantum gravity (e.g., Rovelli, 2011) focuses on the prop- 
erties of space-time itself and does not unify gravity with the other interactions. Kaluza- 
Klein (Chodos et al., 1987) and supergravity (e.g., Nilles, 1984; Wess and Bagger, 1992; 
Terning, 2006) theories connect gravity with the other interactions in a more natural way 
than the SM, but do not yield renormalizable theories of quantum gravity. More promis- 
ing are superstring theories (which may incorporate Kaluza-Klein and supergravity), which 
unify gravity and may yield finite theories of quantum gravity and all the other interac- 
tions. String theories are perhaps the most likely possibility for the underlying theory of 
particle physics and gravity, but at present there appear to be a nearly unlimited num- 
ber of possible string vacua (the landscape), with no obvious selection principle. As of this 
writing the particle physics community is still trying to come to grips with the landscape 
and its implications. Superstring theories naturally imply some form of supersymmetry, 
but the supersymmetry relevant to our sector of Nature might be manifest only at a high 


l0Tests of general relativity are reviewed in (Will, 2014). Particle physics implications of the direct 
observation of gravitational waves by the LIGO interferometers (Abbott et al., 2016) include a limit 
Mg < 1.2 x 10-2? eV on the graviton mass, and the possibility of future observations of signals from 
cosmological phase transitions. 
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scale and have nothing to do with the Higgs/hierarachy problem (split supersymmetry is a 
compromise, keeping some aspects at the TeV scale). 

In addition to the fact that gravity is not unified and not quantized there is another 
difficulty, namely the cosmological constant. The cosmological constant can be thought of as 
the energy of the vacuum. However, we saw in Section 8.2.2 that the spontaneous breaking of 
SU(2) x U(1) generates a value (0|V(v)|0) = —y*/4, for the expectation value of the Higgs 
potential at the minimum. This is a c-number which has no significance for the microscopic 
interactions. However, it assumes great importance when the theory is coupled to gravity, 
because it contributes to the cosmological constant (e.g., Bass, 2011), which becomes 


Neosm = Abare T AssB. (10.4) 


Abare = 8TG NV (0) is the primordial cosmological constant, which can be thought of as the 
value of the energy of the vacuum in the absence of spontaneous symmetry breaking (the 
definition of V (¢) in (8.12) implicitly assumed Apare = 0) and Assps is the part generated 
by the Higgs mechanism: 


lAssa| = 87G y|(0|V|0)| ~ 1056 Acs. (10.5) 


It is some 10°° times larger in magnitude than the observed value Aobs/8SnGyN ~ 
(0.0022 eV)* (assuming that the dark energy is due to a cosmological constant), and it 
is of the wrong sign. 

This is clearly unacceptable. Technically, one can solve the problem by adding a constant 
+pu*/4. to V, so that V is equal to zero at the minimum (i.e., Apare = 27G yut /A). However, 
with our current understanding there is no reason for Apare and Aggg to be related. The 
need to invoke such an incredibly fine-tuned cancellation to 56 decimal places is probably 
the most unsatisfactory feature of the standard model. 

The problem becomes even worse in superstring theories, where one expects a vacuum 
energy of O(M$) for a generic point in the landscape, leading to |Agssp| = 10! Agus. 
The situation is almost as bad in grand unified theories. Finally, any solution must deal 
with other contributions to the vacuum energy, such as zero-point energies (which cancel 
in unbroken supersymmetry) and the smaller but still very large energy associated with 
the QCD vacuum condensates that break chiral symmetry. Contributions from strings, 
electroweak breaking, QCD, and from zero point energies are associated with very different 
physics scales, but somehow they must all add up to a negligible value. 

So far no compelling solution to the cosmological constant problem has emerged. One 
intriguing possibility invokes the anthropic (environmental) principle (Barrow and Tipler, 
1986; Hogan, 2000), i.e., that a much larger or smaller value of |Acosm| would not have 
allowed the possibility for life to have evolved because the Universe would have expanded 
or recollapsed too rapidly (Weinberg, 1989b). This would be a rather meaningless argument 
unless (a) Nature somehow allows a large variety of possibilities for |Acosm| (and perhaps 
other parameters or principles), such as might occur in different vacua, and (b) there is 
some mechanism to try all or many of them. In recent years it has been suggested that 
both of these needs may be met. There appears to be an enormous landscape of possible 
superstring vacua (Bousso and Polchinski, 2000; Kachru et al., 2003; Susskind, 2003; Denef 
and Douglas, 2004; Polchinski, 2015), with no obvious physical principle to choose one over 
the other. Something like eternal inflation (Linde, 1986, 2008) could provide the means to 
sample them, leading to an enormous multiverse of regions with different physical laws and 
parameters. Only the environmentally suitable vacua would lead to long-lived Universes 
suitable for life. Perhaps some of the other fine-tunings or arbitrary features of Nature, 
such as the Higgs/hierarchy problem or the relative values of the light fermion masses (and 
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their implications for the stability of nuclei (Damour and Donoghue, 2008)), could similarly 
be associated with these ideas.'! The landscape/multiverse is highly controversial and is 
currently being heatedly debated. My own view is that if any theory really has a landscape 
of vacua then one must take the multiverse as a serious possibility, even if it is hard to test. 
Recent reviews of the multiverse include (Schellekens, 2013, 2015; Linde, 2017; Donoghue, 
2016). Such paradigms as uniqueness, naturalness, and minimality are further discussed 
in (Langacker, 2017). 


10.1.2 New Ingredients for Cosmology and Particles 


It is now clear that the standard model requires a number of new ingredients. These include: 


A Consistent Incorporation of Quantum Gravity 


Superstring theory is probably the most promising possibility. 


The Initial Conditions on the Big Bang 


The observed flatness, homogeneity, and isotropy of the Universe appear to require very 
fine-tuned initial conditions on the big bang. The fine-tuning could be avoided if there 
were an initial a period of exponentially rapid expansion known as inflation (Guth, 1981; 
Kolb and Turner, 1990; Lyth and Riotto, 1999; Linde, 2008), followed by a reheating. This 
inflation could have been driven by the energy density in a scalar field (the inflaton), most 
likely emerging from BSM physics. 


A Mechanism for the Baryon Asymmetry 


The observed excess of baryons with respect to antibaryons (e.g., Canetti et al., 2012), 
ng/ny ~ 6 x 1071, ng < npg, is presumably due to a tiny asymmetry (ng — nz)/Nq ~ 
107° of quarks compared to antiquarks in the early Universe. This asymmetry could have 
been generated dynamically if the three Sakharov conditions (Sakharov, 1967; for general 
reviews, see Bernreuther, 2002; Dine and Kusenko, 2004; Cline, 2006), (a) baryon number 
violation (to allow B-violating transitions), (b) CP violation (to distinguish q and q), and 
(c) nonequilibrium of the B-violating or other relevant processes (or CPT violation so that, 
e.g., Mg E Mg). 

Baryon number is conserved in the standard model at the perturbative level, but there 
are non-perturbative vacuum tunneling (instanton) effects (t Hooft, 1976b), which violate 
B and L but preserve B — L. These are negligible at zero temperature where they are 
exponentially suppressed (of O (exp [—27 sin? Oy /a] ~ 1078°)), but important at high tem- 
peratures due to thermal fluctuations (sphaleron configurations) (Klinkhamer and Manton, 
1984; Kuzmin et al., 1985). This nonperturbative B violation would not have been enough 
to generate the asymmetry in the SM, however, because the only available candidate for 
non-equilibrium would be a first-order electroweak phase transition (i.e., the cosmologi- 
cal transition at critical temperature Te = O(v) from a high temperature phase in which 
SU (2) x U(1) is unbroken to the broken phase at lower temperature), and this would only 
have been sufficiently strong for a Higgs mass < 35 GeV, far below the observed value. 


11A good analogy is the relative radii of the planetary orbits, once thought by Kepler to have an elegant 
geometric explanation involving the nesting of Platonic solids, whereas we now understand that they are 
an accident of the initial conditions of the solar system. Similarly, the “lucky accident” that the Earth is in 
the Sun’s habitable zone simply reflects that with many stars and many planets some will be “just right”. 
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Furthermore, the CP violation from the CKM and PMNS matrices (or from @gcp) is too 
weak. 

There have been a number of suggestions for explaining the baryon asymmetry in BSM 
scenarios, however. One involves the out of equilibrium decays of superheavy Majorana 
right-handed neutrinos [leptogenesis (Fukugita and Yanagida, 1986; Davidson et al., 2008; 
Hambye, 2012; Fong et al., 2012)], as found in the minimal seesaw model (Section 9.5). 
The decays are expected to occur at a temperature T that is small compared to the heavy 
Majorana mass (i.e., out of equilibirium), but still large compared to T., so that SU(2) xU(1) 
breaking can be ignored. If CP were exact, a heavy Majorana neutrino vm could decay to 
ber (i.e., d°vz_ or otez) or to ot lh (ie, d°'V% or d~ eR) with equal rates. Here, ¢ is the 
Higgs doublet and £z a lepton doublet, as in (8.13) on page 260, and we have suppressed 
flavor indices. However, there will in general be additional CP phases in the mass and 
Yukawa matrices that are not directly observable in the PMNS matrix. The interference 
between tree and loop diagrams could then lead to a rate difference T(vm —> o! lh) > 
T(vm —> ¢£L) and a corresponding lepton asymmetry, ne < nec. The latter could then 
be partially converted to a baryon asymmetry by sphaleron processes. This leptogenesis 
scenario and more complicated versions are very attractive, but in practice a number of 
constraints must be satisfied to generate the observed asymmetry consistent with experiment 
and cosmology. 

Another possibility assumes the existence of a strongly first-order electroweak phase 
transition [electroweak baryogenesis (e.g., Trodden, 1999; Morrissey and Ramsey-Musolf, 
2012)]. A first-order transition would occur by the nucleation of bubbles of “true” (SU (2) x 
U(1) broken) vacuum inside the sea of “false” (SU(2) x U(1) unbroken) vacuum, which 
would eventually expand and fill the entire space. CP violation could occur by interactions 
of the high temperature plasma with the bubble wall, while B + L violation would be due 
to sphalerons outside or near the wall. The needed first-order transition does not occur in 
the standard model for My ~ 125 GeV, but it could in extensions involving larger Higgs 
sectors.‘” For example, those involving SM singlet Higgs fields, such as extensions of the 
MSSM that can generate a dynamical u term, have cubic Higgs interactions at tree-level that 
can easily lead to a strongly first-order transition (e.g., Maniatis, 2010; Ellwanger et al., 
2010). Generic two-doublet models also have suitable parameter regions!? in which the 
cubic interactions are generated by thermal loops (e.g., Cline, 2006; Dorsch et al., 2013). 
The extended Higgs sectors could also provide the needed new sources of CP violation, 
though some care is required to be consistent with EDM constraints (Section 8.6.5). They 
would likely yield signatures observable at the LHC. 

Other possibilities for the baryon asymmetry include the decay of a coherent scalar field, 
such as a scalar quark or lepton in supersymmetry [the Affleck-Dine mechanism (Affleck and 
Dine, 1985)], or CPT violation (Cohen and Kaplan, 1987; Davoudiasl et al., 2004). Finally, 
one cannot totally dismiss the possibility that the asymmetry is simply due to an initial 
condition on the big bang. However, this possibility disappears if the Universe underwent 
a period of rapid inflation that diluted the asymmetry to essentially zero. 


12This is a major motivation for probes of the Higgs cubic self-interactions at existing and future collid- 
ers (Huang et al., 2016; Arkani-Hamed et al., 2016). 

13This could include the MSSM if one of the scalar top quarks is sufficiently light (Carena et al., 2013), 
but this possibility is in tension which LHC constraints. 
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What is the Dark Energy? 


In recent years a remarkable concordance of cosmological observations involving the cosmic 
microwave background radiation (CMB), acceleration of the Universe as determined by Type 
Ia supernova observations, large scale distribution of galaxies and clusters, and big bang 
nucleosynthesis has allowed precise determinations of the cosmological parameters (Kolb 
and Turner, 1990; Peebles, 1993; Hinshaw et al., 2013; Ade et al., 2016; Patrignani, 2016): 
the Universe is close to flat, with some form of dark energy making up 68-69% of the energy 
density. Dark matter constitutes 26-27%, while ordinary matter (mainly baryons) represents 
only about 5%. The mysterious dark energy (Peebles and Ratra, 2003; Copeland et al., 2006; 
Frieman et al., 2008; Weinberg et al., 2013), which is the most important contribution to 
the energy density and leads to the acceleration of the expansion of the Universe, is not 
accounted for in the SM. It could be due to a cosmological constant that is incredibly tiny on 
the particle physics scale, to a slowly time varying field [quintessence (Zlatev et al., 1999)], 
or possibly to a modification of general relativity (Joyce et al., 2016). Is the acceleration 
somehow related to an earlier and much more dramatic period of inflation? If it is associated 
with a time-varying field, could it be connected with a possible time variation of coupling 
“constants” (e.g., Uzan, 2011)? 


What is the Dark Matter? 


There is abundant evidence from galactic rotation curves, the motion of galaxies in clusters, 
gravitational lensing, galactic mergers, and the CMB that there is much more matter in 
the Universe than can be accounted for in stars, gas, and other known forms. The missing 
matter does not seem to be ordinary baryonic matter that is somehow hidden, both because 
of direct astrophysical searches [e.g., for massive compact halo objects (MACHOs)] and more 
generally because the total baryonic density is determined independently by BBN and the 
CMB. It is therefore presumably some new (dark) form of matter that interacts at most 
weakly with electromagnetism.'4 For a historical introduction to dark matter, see (Bertone 
and Hooper, 2016). For general reviews, see, e.g., (Bertone et al., 2005; Hooper, 2010; 
Strigari, 2013; Klasen et al., 2015; Gelmini, 2015; Baudis, 2016; Lisanti, 2016). 

One possibility is primordial black holes (PBH) that somehow formed prior to BBN. 
There are many astrophysical constraints on the density of PBHs (e.g., Green, 2015), but 
there is a window from around 10713 — 1078 Mọ for which they could constitute the dark 
matter, and a second window around 10 Mọ (the range suggested by the LIGO gravitational 
wave signals from merging black holes (Abbott et al., 2016)) where they might account for 
at least some of it (e.g., Clesse and Garcfa-Bellido, 2016). 

It is likely, though not certain, that the dark matter is associated with elementary 
particles. Candidate particles are usually classified as hot, cold, or warm, depending on 
whether they were relativistic, non-relativistic, or intermediate when they decoupled from 
equilibrium!® in the early universe. It was briefly commented in Section 9.2.2 that hot dark 
matter (light massive neutrinos) is excluded because it does not cluster sufficiently on small 
scales, while keV-scale sterile neutrinos are a viable candidate for warm dark matter. 

The most common cold dark matter (CDM) scenario involves a stable (at least on 


14An alternative possibility is to modify Newtonian dynamics (MOND) or general relativity on large 
scales (Bekenstein and Milgrom, 1984; Famaey and McGaugh, 2012). This works reasonably well for galaxies 
but is less successful with larger scales, especially with colliding galaxies, which require some new component 
similar to dark matter. 

15Some types of dark matter, such as PBH or axions, are considered cold if their gravitational clustering 
is similar even though their production mechanisms may be nonthermal. 
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cosmological time scales) neutral colorless particle x, which usually has spin-0 or 1/2 and 
which may or may not be distinct from its antiparticle x. In the thermal dark matter scenario 
x and y are assumed to have equal number densities (if they are distinct), maintained in 
equilibrium number œ T?/2e~™x/T by reactions such as yy + o or xX% © OG, where o 
may be a lighter SM particle, until they freeze out at some temperature Ty. A detailed 
analysis shows that 
3 x 107?" cm?/s 
(ov) ' 
where Q, is the dark matter density in x relative to the critical density, h ~ 0.68 is the 
Hubble parameter H at present in units of 100 km s7! Mpc7', and (cv) is the thermal 
average of the annihilation cross section times relative velocity. The form of (10.6) is easily 
understood: a larger annihilation cross section means that the freezeout occurs later (at a 
larger M,/T,) when the number density is lower. Assuming that Q, ~ 0.26 (the observed 
dark matter density), one requires 


Qh? ~ (10.6) 


(ov) ~ 3 x 10-9 GeV’. (10.7) 


But for S-wave annihilation via a lighter mediator X (such as H or Z) one expects (av) ~ 
a% /Mz, where ax = gx /4m and gx is the relevant coupling. In particular, particles in 
the general range of 100 — 1000 GeV will automatically lead to dark matter densities via 
thermal production in the observed ballpark for weak interaction strength couplings, e.g., 
ax ~ 107? for My ~ 200 GeV. This is referred to as the WIMP miracle, and such weakly 
interacting massive particles are known as WIMPs. 

WIMPs candidates include the lightest supersymmetric partner (usually a neutralino) 
in supersymmetric models with R-parity conservation (Section 10.2.5), or analogous stable 
particles in Little Higgs or universal extra dimension models. As we will see, however, 
experimental constraints on these conventional candidates are significantly reducing the 
allowed parameter space. This has in part been the motivation for considerable attention to 
the possibility of much lighter dark matter particles, perhaps associated with some dark or 
hidden sector (e.g., Alexander et al., 2016) with very weak coupling to ordinary particles. 
Such sectors, which occur in some superstring constructions, may be connected to the 
ordinary sector by kinetic mixing of the Z with a new Z’ gauge boson (Section 10.3.1), or 
by other types of portals involving Higgs bosons, heavy Z’, neutrinos, axions, etc. 

There are many direct, indirect, and collider searches for WIMPs and similar particles 
(see the general reviews, as well as (Jungman et al., 1996; Cushman et al., 2013) and the 
Dark Matter review in (Patrignani, 2016)). The direct experiments look for the nuclear 
recoil from an elastic (or possibly inelastic) scattering xN — xN. The experiments are 
carried out on nuclear targets, and can search for spin-independent (SI) and spin-dependent 
(SD) scattering processes. SI scattering acts coherently on all of the nucleons and therefore 
scales as A?. The SD case, which is proportional in amplitude to the net spin, is much less 
constrained. SI scattering is associated with interactions such as ¥yNN or xy“ xN Ya N (for 
spin-1/2), which could be, respectively, generated by t-channel H or Z exchange. The latter 
would require a Dirac x. For a Majorana x, such as neutralinos in the conventional MSSM, 
Higgs exchange would still lead to SI scattering, but Z exchange would involve the operator 
XV xN N, which only contributes to SD scattering. (These statements are easily seen 
by using the Pauli-Dirac representation for the y matrices and spinors (Section 2.7.2) in the 
non-relativistic limit, and the fact that a Majorana spinor has no vector coupling to the Z 
(Problem 9.4.)) None of the experiments has observed the direct scattering of WIMPs. There 
have been some possible positive signals, especially from the DAMA/LIBRA experiment 
based on annual modulation (Bernabei et al., 2010), but these are difficult to reconcile with 
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other exclusions within the canonical WIMP scenario. The limits on the SI cross sections 
per nucleon’® are shown in Figure 10.2. Much but not all of the cross section region for 
heavy (100—1000 GeV) WIMPs in the MSSM is excluded. For lighter WIMPs the limits are 
approaching the neutrino floor, i.e., the cross sections for which the rates are comparable 
to backgrounds from solar, atmospheric, or diffuse supernova neutrino scattering. Going 
further would require some sort of directional sensitivity (e.g., Grothaus et al., 2014). 
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Figure 10.2 Limits on spin-independent cross sections for elastic WIMP-nucleon scat- 
tering vs. WIMP mass, assuming a theoretical model for the local density and ve- 
locity distribution consistent with Q, ~ 0.26. The enclosed regions are reported 
positive signals. The low mass part of the neutrino floor and the part of the MSSM 
parameter space that is still allowed are also shown. Plot courtesy of the Particle 
Data Group. 


Indirect detection refers to searches for y’s, v’s, e*’s, p’s, etc. that result directly from 
the annihilation or possibly decays of dark matter particles in regions of high concentration, 
or from the decays or secondary interactions of the directly produced particles. For example, 
neutrino signals could be due to WIMPs concentrated in the Sun or center of the Earth, 
while those near the galactic center or in other galaxies could lead to monoenergetic photons 
from direct annihilation, or to continuum photons or antiparticles from secondary processes. 
Indirect detection is sensitive to a wide mass range from small to multi-TeV. There have 
been various hints of signals involving photons or excesses of antiparticles (see Gaskins, 


16The experiments on different nuclear targets can be directly compared assuming that the scattering is 
isospin-independent, i.e., the same for p and n. Isospin-dependent scattering is less constrained. 


Beyond the Standard Model 437 


2016, for a recent review), but because of possible astrophysical backgrounds none have so 
far proved compelling. 


ATLAS and CMS have searched for dark matter particles via pp > VX + X, where 
X contains an energetic monojet, y, Z, H, etc, typically radiated from an initial parton. 
The Y's are not directly observed, so the signature is X plus a large missing transverse 
momentum. The sensitivities are comparable to those of the direct searches for both SI and 
SD operators.!” So far, no positive signals have been observed. 

There are many variations on the thermal WIMP scenario. For example, equilibrium 
could be associated with more complicated processes, such as resonant annihilation (with 
2M, close to the mass of an s-channel resonance), coannihilation between x and some 
other particle (presumably similar in mass), or in reactions other than 2 + 2. The WIMPs 
could also have strong interactions amongst themselves (SIMPs), there could be multiple 
dark matter components, or they could scatter inelastically to excited states. Increased 
numbers of WIMPs could be produced by non-thermal mechanisms, such as the decay of 
a coherent field, or in asymmetric scenarios, in which there is somehow an asymmetry 
between the numbers of x and ¥ analogous to the baryon asymmetry. Another possibility 
(SuperWIMPs) (e.g., Feng, 2010) is that the dark matter particles are extremely weakly 
interacting, and are produced by the late decays of WIMPs. An example are light gravitinos, 
expected in some versions of supersymmetry with a low breaking scale. 

Invisible axions are a very different possibility for cold dark matter. They were originally 
introduced in connection with the strong CP problem, but axions and axion-like particles 
can occur more generallly. (See the discussion of the strong CP problem in Section 10.1.1.) 
For masses in the 1075 — 1074 eV range they would be viable CDM candidates, produced 
non-thermally by the decay of a coherent field. The details depend sensitively on the mass 
and scale, and on whether they were produced before or after a period of inflation (for 
reviews, see Kawasaki and Nakayama, 2013; Marsh, 2016). 

The cold dark matter scenario is quite successful at describing large-scale structures, 
but is less so on scales smaller than our galaxy. Most CDM simulations suggest that there 
should be more dwarf and satellite galaxies and a steeper central density profile for dwarf 
galaxies than observed. These discrepancies may be due to inadequate modelling of the 
astrophysics, but they could also be hints that the CDM picture needs to be modified. 
Possible candidates include warm dark matter associated with keV-scale sterile neutrinos 
(Section 9.2.2). Another is self-interacting (SIMP) dark matter. We finally mention the 
interesting suggestion of fuzzy dark matter (Hu et al., 2000; Hui et al., 2017), which consists 
of ultralight, O(10~?? eV), scalars produced by the decay of a coherent field (similar to 
axions). These would have kpc-scale de Broglie wavelengths, with the small-scale dynamics 
controlled by its wave nature. 


The Suppression of Flavor Changing Neutral Currents, Proton Decay, and Electric Dipole 
Moments 


As discussed in Sections 8.6.5 and 8.6.6, the standard model has a number of (approximate) 
accidental symmetries and features that forbid proton decay at the perturbative level, pre- 
serve lepton number and lepton flavor (at least for vanishing neutrino masses), suppress 
transitions such as K* 4 mtvv at tree-level, and lead to highly suppressed electric dipole 


17Direct, indirect (for annihilations into quarks), and collider processes can be loosely thought of as 


i (=) = = (>) ; . 
crossed channel reactions, e.g., Xq > xq, X X — qq, and gg —> x x , respectively. However, comparisons are 
complicated by the very different kinematic regimes. 
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moments for the e~, n, atoms, etc. However, most extensions of the SM have new interac- 
tions that violate such symmetries, leading to potentially serious problems with FCNC and 
EDMs. There is a tradeoff/conflict between these constraints, which favor high mass scales 
for the new physics, and naturalness, which favors lower scales. 


A Mechanism for Small Neutrino Masses 


The most popular possibility is the minimal seesaw model, implying Majorana masses, 
but there are other plausible mechanisms for either small Dirac or Majorana masses, as 
described in Chapter 9. 


Possible Types of New Physics 


There is an enormous range of possibilities for new physics beyond the standard model (e.g., 
Eichten et al., 1984; Morrissey et al., 2012; Nath et al., 2010; Alves et al., 2012; Gershtein 
et al., 2013), many of which were mentioned above or in Chapter 8. Many types are “bottom- 
up,” i.e., motivated by attempts to resolve problems of the standard model, to understand 
the dark matter, or to explain possible experimental anomalies. These include supersym- 
metry; extended Higgs sectors; family symmetries; extended TeV-scale gauge groups; new 
types of particles, such as leptoquarks or diquarks; large and/or warped extra dimensions; 
dark or hidden sectors; and strong coupling theories such as dynamical symmetry break- 
ing, composite Higgs models, or composite and excited fermions!® (Harari, 1984). Other 
“top-down” types, such as grand unification and superstring theories, attempt to achieve 
a more fundamental unified understanding of the microscopic interactions, to incorporate 
gravity, to describe the dark energy, or to make contact with ideas concerning the early 
universe. Bottom-up and top-down ideas are not mutually exclusive, and ideas such as su- 
persymmetry can be motivated from either. Additional or exotic heavy fermions, additional 
Higgs particles, new types of interactions, hidden sector particles, etc., may also arise as 
remnants (e.g., Barger et al., 2007), i.e., particles emerging from an underlying theory char- 
acterized by a much larger scale that remain light by some accident in the dynamics or 
symmetry breaking. This often occurs, for example, in superstring vacua in which most but 
not all degrees of freedom achieve Planck scale masses. Such remnants are quite plausible 
from a top-down perspective, though they may appear superfluous and non-minimal from 
a bottom-up view. 

In the following we describe three important examples of possible new physics: super- 
symmetry, extended (TeV-scale) gauge symmetries, and grand unification. 


10.2 SUPERSYMMETRY 


Supersymmetry is a hypothetical symmetry very different from those we have so far encoun- 
tered, viz., between bosons and fermions. It is especially motivated by attempts to unify 
gravity with the other interactions, especially in supergravity and superstring theories, 
though that does not by itself imply that supersymmetry is relevant at the TeV scale. How- 
ever, Supersymmetry broken at the electroweak-TeV scale is one of the leading contenders 
for solving the Higgs/hierarchy problem discussed in Section 10.1. Other motivations in- 
clude gauge coupling unification, which is much more successful in the supersymmetric 


18Theories of composite quarks and leptons should involve new flavor-diagonal four-fermion interactions 
suppressed by the compositeness scale (Eichten et al., 1983), e.g., by the binding forces or by constituent 
interchange, as well as excited states. Experimental searches (Patrignani, 2016) indicate that the scales 
would have to be in the multi-TeV range, e.g., with the bound state masses protected by a chiral symmetry. 
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extension than in the SM, and the existence of natural candidates for cold dark matter in 
some versions. The possible anomaly in the anomalous magnetic moment of the uw* could 
also be accounted for by relatively light supersymmetric partners, as mentioned in Section 
2.12.3. No sign of supersymmetry has emerged from the LHC as of mid-2016, suggesting 
that if supersymmetry exists at all the mass scale for its breaking must be higher than early 
expectations. Nevertheless, the motivations are sufficiently strong to continue the search. 

The technology and phenomenology of supersymmetry are extremely complicated, and 
here we can only give a brief introduction. These topics are thoroughly described in a number 
of articles (Fayet and Ferrara, 1977; Nilles, 1984; Haber and Kane, 1985; Lykken, 1996a; 
Martin, 1997; Peskin, 2008; Patrignani, 2016) and books (e.g., Wess and Bagger, 1992; 
Weinberg, 2000; Polonsky, 2001; Drees et al., 2004; Baer and Tata, 2006; Binétruy, 2006; 
Terning, 2006; Kane, 2010; Dine, 2015). The original idea of a symmetry between bosons and 
fermions involved the two-dimensional worldsheet of a string theory (Ramond, 1971; Neveu 
and Schwarz, 1971; Gervais and Sakita, 1971). Supersymmetry for a four-dimensional field 
theory was introduced in (Wess and Zumino, 1974a,b; Salam and Strathdee, 1974), and 
applied to the SU(2) x U(1) model in (Fayet, 1975). The history of the development of 
supersymmetry is described in detail in (Weinberg, 2000; Wess, 2009). 


10.2.1 Implications of Supersymmetry 
Corrections to the Higgs Mass 


As described in Section 10.1 the SM leads to troubling quadratically-divergent contributions 
to Mj,. For example, the top quark contribution (the first diagram in Figure 10.3), implies 
a radiative correction!® 


—i (AM}), = —N.(—ih yf ak Tr : (10.8) 
ArT : í (27)4 K-m K-m J’ 
where —1 is due to the closed fermion loop, Ne = 3 counts the t-quark colors, and the 
vertices follow from (8.46) on page 267 with hy = m/v. The quadratically divergent part 
can easily be calculated by applying a Wick rotation, as in (E.14), to yield 


 ik3.dkp dQ, —k? +m? Ne(he)2 
(AMÈ), = —4iNc(ht)? ikpdkp dQa BM, c(h) 
~ o (nf 


A A2 
Eim Te 


(10.9) 


since f dQ4 = 27°. 
Now, suppose there are two complex color-triplet scalar fields ¢,,r = 1,2, with mass 
and couplings to H given by 


Lo H = ArH? oho, — Kp Holo, — metor. (10.10) 


Each contributes to M7, by the middle and right diagrams in Figure 10.3. The middle 
diagram is quadratically divergent, yielding 


4 Š 
a : : ArNe 42. (10.11) 


~i(AMa),. = —2i,Ne | u eara 7 Mae Ge 


The quadratic divergences cancel if we choose Ay = \2 = (h;)?. This would be a remarkable 


19The diagram yields the t contribution to (H|iAL|H). (We are being careless about external momenta 
since we are considering the divergent part.) But £ = — 4M} H? +---,so the matrix element is —i (AM},),- 
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accident if not enforced by some symmetry. Fortunately, in the supersymmetric extension 
of the standard model it occurs naturally: in that case ¢1,2 are the scalar partners of the tz 
and t$, (i.e., tr), respectively, and the necessary coupling constant relation is enforced by 
the supersymmetry. In fact, in the supersymmetric limit, one also has that x, = 2v(h;)? and 
m? = m?, which implies that the logarithmically divergent contributions to M?, also cancel, 
as shown explicitly in, e.g., (Terning, 2006). This is an example of the non-renormalization 
theorem in the supersymmetric limit. In the presence of soft supersymmetry breaking the 
quadratic divergences continue to cancel, but there are finite contributions to M7, related 
to the supersymmetry breaking scale. (See Equation 10.164.) 

Supersymmetry is therefore successful at solving the part of the hierarchy problem asso- 
ciated with the loop corrections. However, as we will see below it introduces a new tree-level 


hierarchy problem, the u problem. 


tr prk } 2 
H=- g Hee Hof la 
hi WS hi Ar Kr M ; Kr 
-t o" 
tr Pr 


Figure 10.3 t quark and ¢, contributions to M?,. 


The Supersymmetric Spectrum 


Supersymmetry is elegant in its principles but not economical in its particle content: it 
requires a more than doubling of the SM spectrum. Each standard model particle must 
have a superpartner (or sparticle) differing in spin by 1/2 unit, which we will denote with 
a tilde. The sparticles have the same SU (3) x SU (2) x U(1) assignments as their partners, 
and for phenomenological reasons none of the SM bosons or fermions can be each other’s 
partners. In particular, each left- (right)-chiral quark qz (qr) is predicted to have a spin-0 
scalar quark (squark) partner qz (r). Of course, the L and R labels for a spin-0 particle 
are not directly related to spin—they simply mean that the scalar is the partner of the 
corresponding quark. As discussed in Section 2.11, it is convenient to work in terms of the 
left-chiral particles and antiparticles, using the correspondence qf, = C ap (Equation 2.301 
on page 56) and gf = qi. In the supersymmetric limit, the masses of the quarks and 
squarks would be the same and their interactions related in a definite way, as we saw in 
the example of the cancellation of the Higgs quadratic mass divergence. No light squarks 
or other superpartners have been observed, so the supersymmetry must be broken, with 
the sparticles all relatively heavy, e.g., in the TeV range.?° Similarly, the leptons @ must 
have spin-0 slepton partners l, and the gauge bosons must have spin-1/2 gaugino partners, 
such as the gluinos (G), the winos (W), and the bino (B). Extensions to supergravity also 
predict a spin-3/2 partner of the graviton (g), the gravitino (93/2). 

Supersymmetry also requires an extended Higgs sector. We saw in Section 8.1 that a 
single Higgs doublet ¢ could have the needed Yukawa couplings to generate masses for both 
the u and d quarks by making use of the conjugate ¢ defined in (8.14) on page 260 by 


01t is fairly natural for the superpartners to be much heavier than the SM particles, because the former 
can all acquire large masses without the SSB of SU(2) x U(1), while the SM particle masses other than the 
Higgs all require SSB. 
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$ = ir?’ (this tilde does not represent a superpartner). However, supersymmetry does 
not allow the needed ¢ Yukawa couplings, so we must instead introduce two separate Higgs 


doublets ot g 
Pes d — u 
ba=( $9 2 T ). (10.12) 


where ġa is similar to the SM ¢ and has the couplings needed to generate masses for the d 
quarks and charged leptons, while ¢, plays the role of the SM ¢, and can lead to masses 
for the u quarks and Dirac neutrino masses. It is actually more convenient to introduce the 
conjugate doublets 


ht 2 h? te 
ie ( k ) =u,  ha= ( L ) = da, (10.13) 
u d 


which will allow us to write the Higgs Yukawa interactions in a manifestly supersymmetric 
form. In any case, the minimal supersymmetric standard model (MSSM) involves 3 neutral 
and one conjugate pair of charged Higgs particles (not including the Goldstone bosons), as 
opposed to the single Higgs scalar of the SM. 

Supersymmetry does not allow the quartic Higgs self-interaction \(¢')? of the SM. 
Rather, its role is played by (known) gauge couplings. This removes one of the most arbitrary 
aspects of the SM, and leads to a theoretical upper bound (at tree level) on the mass of the 
lightest neutral mass eigenstate h°, 


M?. < cos” 26 MZ, (10.14) 


where tan 8 = |(0|h°|0)|/|(0|29)0)| is the ratio of neutral Higgs VEVs. Equation 10.14 is 
inconsistent with My = 125 GeV (assuming, as is likely, that the observed H corresponds 
to h?). However, there are large radiative corrections to the effective potential associated 
with top and scalar top loops that increase the upper bound to ~ 135 GeV. 

The two Higgs doublets must also have spin- partners (Higgsinos). (If there were only 
one Higgs doublet the Higgsinos would introduce triangle anomalies, providing another 
rationale for the second doublet.) The Higgsinos can mix with the winos and bino to pro- 
duce two mass eigenstate Dirac charginos (X*, r = 1,2), and 4 mass eigenstate Majorana 
neutralinos (X°, r =1---4). (These are sometimes denoted Ĉ and N° instead.) 

The MSSM particles are listed in Table 10.1. 


Other Implications and Difficulties 
There are a number of other implications of supersymmetry, both good and bad. 


We have not yet observed any superpartners, so supersymmetry must be broken, prob- 
ably with sparticle masses in the TeV range or higher. To avoid the reintroduction 
of the hierarchy problem the breaking should be soft, i.e., appearing only in scalar 
and gaugino mass terms and in cubic scalar couplings. However, general soft breaking 
terms introduce an enormous number of free parameters.”! Moreover, there are mass 
sum rule constraints that would be violated for spontaneous or dynamical breaking 
that occurs directly in the ordinary sector associated with the MSSM particles. Super- 
symmetry breaking therefore most likely occurs in some hidden sector which is only 
very weakly coupled to the ordinary sector. How this breaking occurs (e.g., Intrili- 
gator and Seiberg, 2007) and how the information is transmitted to our sector (the 
mediation mechanism) (e.g., Chung et al., 2005) introduce considerable uncertainties. 


21The MSSM with R-parity conserved has 124 free parameters (Dimopoulos and Sutter, 1995; Patrignani, 
2016), not including neutrino masses and mixings, right-handed scalar neutrinos, Mp, or Acosm. 


442 


The Standard Model and Beyond 


TABLE 10.1 Standard model particles and their supersymmetric partners.® 


: UL sat Fa ÙL so Se hy hy 
ono (g) aa (g) aa Gi) Ge) 
> td ht ho 
spin-4 ea us. dS ee ve GWiB (a) i) 
u d 


xo xt 
spin-1 G WB 
WE ZY 
spin- 93/2 
spin-2 g 


“Family indices and mixing are ignored for the fermions and their scalar partners. The right-chiral 
fermions and their partners are related by Yr = CY°T and wr = pi. The gauge and Higgs particles are 
listed in the weak basis. The graviton g and its partner the gravitino, g3/2, are also listed. 


Many supersymmetric models involve or impose a discrete R-parity symmetry (Farrar 
and Fayet, 1978), Rp, which requires that every allowed interaction vertex involves an 
even number of superpartners. This implies that the lightest superpartner (the LSP) 
is absolutely stable, and therefore a candidate to be dark matter. Neutralinos are 
the most promising possibility, although scalar neutrinos or the gravitino are a priori 
possible. 


In the decoupling limit, in which the superpartners and extra Higgs fields are all 
much heavier than the electroweak scale, the contributions of the new particles to 
electroweak precision observables is small, consistent with the excellent agreement 
with the SM predictions. Also, the lightest neutral Higgs (h°) acts very much like the 
SM Higgs in this limit. 


On the other hand, the sparticles lead to possible new sources of FCNC and of CP 
violation, e.g., in the K and B systems and in EDMs. The non-observation of such 
effects suggests that sparticles should be very heavy and/or that sparticles of a given 
type are nearly degenerate and/or that there is some kind of alignment between the 
mixing effects in the quark and scalar quark sectors. 


The combination of gauge invariance and supersymmetry does not allow masses for 
any of the MSSM particles (prior to the SU(2) x U(1) and supersymmetry breaking), 
with the exception of the Higgs scalars and their Higgsino partners, which are allowed 
to have a common arbitrary mass u. If the supersymmetry derives from an underlying 
string theory one might expect to be comparable to the Planck or string scales, 
or possibly 0 if it is forbidden by some extra symmetry. However, neither 0 nor a 
very large value for p is allowed phenomenologically. The u problem (Kim and Nilles, 
1984), which is a tree-level form of the Higgs/hierarchy problem, is to understand 
why u should be nonzero and comparable to the soft supersymmetry breaking scale. 
Possible explanations are discussed in Section 10.2.6. 


The Higgs soft mass parameters are typical soft supersymmetry breaking parameters. 
Assuming that the problem is somehow solved, the scale of electroweak symmetry 
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breaking is therefore tied to the supersymmetry breaking scale, up to an order of 
magnitude or so. (A larger splitting reintroduces a more moderate version of the 
hierarchy problem, i.e., the little hierarchy problem.) 


Many, but not all, supersymmetry breaking and mediation mechanisms imply that the 
Higgs and other scalar mass-squares are positive at some large scale, such as at Mp. 
However, electroweak symmetry breaking is most easily accomplished for a negative 
hy, Mass-square Mi» though this is not absolutely essential. These constraints can 
be reconciled by the fact that the soft mass-squares are running quantities, just like 
coupling constants (e.g., Martin, 1997). A positive m? at a high scale can be driven 
negative at a low scale by a large top-Yukawa coupling h+. This radiative breaking 
mechanism (e.g., Ibáñez and Ross, 2007) therefore requires a heavy top quark mass. 


10.2.2 Formalism 


In this section we summarize some of the formalism used in the construction of supersym- 
metric field theories. 


The Lorentz and Poincare Groups 


Let us briefly survey the Lorentz and Poincaré groups, which describe the classical spacetime 
symmetries. A Lorentz transformation can be defined by its action on four-vectors, 


V 


1 v $ =1 
choo = Ab ao”, Eu > £u =o, (A Vn 


(10.15) 
with Oy. = A,’. The group of such transformations is known as SO(1, 3), i.e., it leaves 
invariant £ y, = guy xy”, where guy = diag(1, —1,—1,—1). A can be expressed in terms of 
its generators’ (i.e., elements of the Lie algebra) Mpo by 


A = e280 Moo (10.16) 


where the 6 group parameters w’ are real and antisymmetric in the indices, as are the 
6 generators Mpo. The group action and commutation rules can be obtained using the 
classical representation 

Mpo = LpPo — ToPp, (10.17) 


where pz = +70, is the position space representation of the momentum operator P,. For 
small w?” one finds 
gh ~ (54 — wt r”. (10.18) 


V 


It is useful to define the group parameters 


w= 3e Wik C= Ww, i,j: = 1,253; (10.19) 
and generators 
i 1 ijk > =) 4 3 Oi (0 D 3 iO 
Fa SeMMy = (Exp), Ki = M” = opi — s'h, (10.20) 
22These elements actually represent the proper orthochronous Lorentz group, with det A = +1 and 


A9) > +1. It must be supplemented with space reflection and time reversal to obtain the full Lorentz group. 
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so that 


1 z 
zo Mo =0: J+ R. (10.21) 


J’ are the rotation generators (angular momenta), and KŻ generate Lorentz boosts in the i 
direction. The rapidity Ç, defined in Problem 1.4, is related to the velocity 6 of the Lorentz 
boost by ¢ = 6 tanh~’ 8, which is approximated by 8 for small 8. Then 


a ~ a — Big’, ging — (0 x TY — pr, (10.22) 
which indeed represent? an infinitesimal rotation by @ and boost by 8. 
One can extend to the Poincaré group by including the translations 

a! = Tr = et Po y= g-a, (10.23) 


with a field (x) transforming as in (1.14) on page 3, i.e., 


P(x) > P (x) = et Po h(x) et Pe = (x + a) = (Ttr). (10.24) 
The Lie algebra of the Poincaré group can be derived from [£p, Po] = —igpo; 
[Pus P,] =0, [Myv, P,] = —i(GupPr — gupPu) (10.25) 
[My Mpo] = i(JupMpo — GvoMup — IupMvo + Juo Mvp), 
and correspondingly 
[JŻ, Jİ] Steg”, [KŻ, JÍ] = ici K", [K*, Kİ] = Seg”. (10.26) 
This implies that Ji, = $[J' +iK'] form a commuting SU(2)z x SU(2)r algebra, 
a js o= (10.27) 


which means that the compact representations of the Lorentz group can be labeled by 


(j+,j-), where j+ = 0, 3,1, --+ are defined by the Casimirs J? = j4(j+ + 1) and J2 = 


j-(j- +1). A field (x) transforming under the (j4, j—) representation goes into 


D(x) > B(x) = e72 Moo B(x) e230” Me = Ag, j _yO(A-12), (10.28) 


jjaja 
under a Lorentz transformation A, where Aç;,,j_) is the representation matrix of e39" Mpo 
(cf. Equations 3.28 and 3.30). For example, the four-dimensional (5, $) representation is just 
the defining four-vector representation, as will be shown in an example below. Similarly, 
the (4,0) and (0, 4) representations”* are two-dimensional, with J = 6/2 and K = Fie/2, 
respectively, 


ig. 226.2 jg3.2 =F. 2 = 
Adige 2s, Aga ae 2s = AG): (10.29) 
They describe the transformations of L- and R-chiral Weyl spinors Yz and Wp, respectively, 
as is shown in Problem 10.2. The conjugate representations 


ak 


sig SHO 1T i= 
Rey DG ye Aa =e 


* * 


iE _ 4 


(3,0) 


are equivalent but useful for constructing Lorentz invariants. 


?3These are passive transformations, in which x’ represents the coordinates of an event in a transformed 
coordinate system. For an active transformation, in which an event is rotated and boosted in a fixed reference 
system, the signs of & and Č must be reversed. 

24These representations define the classical group SL(2,C) of complex 2 x 2 matrices with unit determi- 
nant. It is obvious from this form that the Lorentz group is non-compact, i.e., ¢ can take any real value, 
and Tr (KŻ Ki) = —464 < 0. 
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Spinor Notation for Two-Component Fields 


The notations for L- and R-chiral fermions that we introduced in Chapter 2 can be confusing 
when discussing C and CP transformations, Hermitian conjugation, Majorana masses, or 
independent fields, especially in four-component notation. In supersymmetry one works 
mainly in terms of L-chiral particle and antiparticle fields. It becomes tedious to express 
Dirac or Majorana mass terms or Yukawa couplings using the notation developed so far (e.g., 
the Dirac mass terms in (2.347) on page 63 or the Majorana ones in (9.8) on page 376), so it 
is worthwhile here to introduce a more streamlined version of the two-component notation. 
The key points are that Hermitian conjugation reverses the chirality of a field, and that the 
io? that enters charge conjugation can be viewed as a raising/lowering operator. 

We have used the notation that Yz and W¢ (or Yz and W¢) are left-chiral fields, while 
Hermitian or Dirac conjugated fields, such as Yh = Cy? = CU) (or VQ = io? VF = 
io?(W,)'”), and the corresponding conjugates of Y$, are right-chiral. A more compact 
notation is to write all left-chiral spinors by a symbol such as € (without a bar), and right 
chiral spinors by a symbol with a bar,?° such as 7. (The bar has nothing to do with Dirac 
adjoint or with antiparticle). One can, if desired, refer to a left-chiral antiparticle spinor 
with a superscript?° c. Thus, for example, 


U> Wat, Bean VROT. (10.31) 


It is often useful to display the components. It is conventional to use a lower undotted index 
for an L-spinor and an upper dotted one for an R-spinor, e.g., 


Ura > ba; Ura > Ê, (10.32) 


where a = 1,2. The dotted index simply indicates that it is an R-spinor, and also serves 
as a warning since dotted and undotted indices are never contracted. The dot and bar are 
superfluous since they both indicate R, but both are useful depending on the context. The 
R-spinor is introduced with an upper index for convenience in the construction of Lorentz 
invariants and covariants. Under Lorentz transformations, € and 7 transform as (4,0) and 
(0, 4), respectively, i.e., 


ba Mies, > (MTH) n, (10.33) 


where 


M.f = (Ago): (Hye, = o (10.34) 


with Aco) defined in (10.29). 
We next introduce raised and lowered indices, associated with the conjugate represen- 
tations of the Lorentz group. Let 


eP = ebt = eap, a, B = 1,2, (10.35) 


be an antisymmetric tensor with ¢!? = —e12 = 1, with a similar definition for e*?. Note 
that €°? is just the a component of ic?. Then 


Ee yg = —E™ Epy = OB. (10.36) 


25Some authors use a f or *. 
26¢€¢ and ÑE are not independent of 71 and £t. 
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We define the L and R spinors €% and 74 with raised or lowered indices by 
E” = Pp, Ne = eag. (10.37) 
These transform as ce 0) and (0, D respectively, i.e., 
E PMT tha > MDP (10.38) 
Lorentz invariants can then be formed by contracting two L or two R spinors, 


Eri = Effa = Ezab? = eager Eh Ey = hig = Sok, (10.39) 


which commute since the fields themselves are anticommuting fermions. The shorthand 1&2 
always implies that the first index is upper and the second is lower. Similarly, 


Tite = Mant = Men, (10.40) 


where the convention for the barred spinors is that the first index is lower and the second 
is upper. Another way of saying this is that ¿œ and 7% are interpreted as column vectors, 
while €° and ña are row vectors. From (10.33) and (10.38) both £2 and #172 are Lorentz 
scalars. 

Next, we recall that Hermitian conjugation converts L spinors into R spinors. Therefore, 
(£a)! is a barred spinor with a lower index, and similarly for (nê), 


Ea = (Ea), a= Gy)’, (10.41) 


so that E 
En = (ne). (10.42) 


We can use (2.200) on page 41 to express a four-component Dirac field as 


(B)-G) ()-G) ons 


The mass terms for Y% can therefore be written as 


Uibr = V} Ur = EÑ = Eañ’, YrYL = vitz = NE = N ba 


ne i _ (10.44) 
Ob = Orbe + Prhe = VL Yr + VEU, = & + ng, 
similar to the last form of (2.347) on page 63. For two Dirac fields 41,2, 
Vir ber = Erfe, Virber = mê. (10.45) 
We also introduce the vector bilinear forms 
HONE = Ra o"PEy = (Ean), Eo = got nf = (no), (10.46) 


for arbitrary spinors € and ñ, where we interpret the first (second) index of g” as a dotted 
(undotted) upper index, and the first (second) of o” as undotted (dotted) lower. It is 
straightforward to show 


ghee = ee af. cae = Cay gge””s (10.47) 


Beyond the Standard Model 447 


which imply 


BELE = —€o"R. (10.48) 
The bilinear forms in (10.46) transform as Lorentz four-vectors. For example, 
HOME > iMa ME. (10.49) 
One can show (Problem 10.3) that 
Migt#M = Ata”, (10.50) 


establishing the result. 
Vector currents for a Dirac field can be written 


pir” vet = Ul a" Wo, = folly, = Loa" E 


i : (10.51) 
pir der = Vi po" Var = mone = M20" M. 
The QED Lagrangian density in (2.218) on page 43 in spinor notation is 
= z 1 
L = Eo" [ið p + eAy] E + no” [iðn + eAp] 7 — m(Eq + nE) — gee E (10.52) 


where the electron field is y = (€ 7)". The second term in (10.52) can be rewritten 
as 70" [ið , — eA ] n, corresponding to the second form of the electromagnetic current in 
(2.346), i.e., in terms of Yz and W¢ rather than Yz and Ypg. Similarly, the tensor current 


is 
Piro” dor = Yi, 5” Vir = 28 3, Piro” yor = 2m sé, (10.53) 
where 5#” and s#” are defined in (2.338) on page 61. Other identities can be found in, 
e.g., (Chung et al., 2005; Dreiner et al., 2010). 
The expressions for the C, P, and T transformations can easily be translated into the 
spinor language 


ft e fn 1 2 ‘a 

v=(S)ow-(2) sp) wsi) 
E -f Sx A g al Ex 

tala) (arla) earl): 


up to possible intrinsic phases.?’ It is understood that the appropriate transformations are 
made on the space-time variable x. We see from (10.54) that charge conjugation interchanges 
€ and n, i.e., E° = ņ and 7° = £. As simple examples, the space reflection transformations 
expressed in 4 and 2 component language include 


Pirr P™! = dire 4> P&ipP7* = mé 
PoiryderP | = dire > P&re P = Mmo, 
while charge conjugation leads to 
ChirverC | = Pirr = prir 4> C&O = qé = &m 
Cpr" porC™ = pir" ba, = -pry dir (10.56) 
=> C&O! = mon, = —mo"m. 


(10.54) 


(10.55) 


27 The transformations P, CP, and T raise or lower indices, implying an extra minus sign when acting on 
a spinor with the index in the “wrong” location. For example, Pg PT! = 7%, while PE“ PT! = —ña. There 
is also an extra sign in PYP! because of the opposite intrinsic parity for an antifermion, as discussed 
below (2.332). 
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The Fierz identities in (2.340) and (2.341) become 
(10 £2) (E364) = (&15" £4) (E30 ,E2) = 2(E2€4) (E3é1) 
(mon) (E3Fu€4) = —2(m £4) (E372). 


Finally, a Majorana fermion, such as the triplet Majorana neutrino introduced in (9.6) 
(page 375) and (9.8), can be written as 


w(K )=(Q)> (Bg )=(B)- ons 


so the free field Lagrangian density in (9.7) and (9.8) becomes 


(10.57) 


L = Ži" ð £ — > [EErEE]; (10.59) 


where vm = (£ ET. 


The Supersymmetry Algebra and Representations 


Supersymmetry transformations connect fermions and bosons, and the associated genera- 
tors Q must therefore be fermionic,”* i.e., anticommuting. In analogy with the Wey] fields 
¢ and 7, we will introduce fermionic charges Qa, a = 1,2, and their conjugates Qa = (Qa)!. 


The N = 1 supersymmetry algebra?’ is 
(Qa a} =20!', Por (Qu Pa] = [aso] =0 
{Qa QB} = {Qa Q} =0 (10.60) 
[Qa, Myu] = (Sura Qe, [Q*, My] = (Gur) $P. 


The non-vanishing commutators of the supersymmetry charges with M,,, are an indica- 
tion that supersymmetry connects states of different spin and statistics. Because of the 
antisymmetry, the product of three of more Q’s must vanish 


QaQsQ, = 0, (10.61) 


and similarly for the Q. The indices can be raised and lowered using €°°, just as for the 
Weyl spinors, e.g., Q% = e°? Qg. The first relation in (10.60) implies 


(0|H|0) = TORQ + (Q1) Qı + Q2(Q2)' + (Q2)'Q2|0) > 0, (10.62) 


that is, the ground state energy of a supersymmetric field theory must have nonnegative 
energy. Furthermore, if the supersymmetry is not spontaneously broken, i.e, for Qal0)} = 
(Q.)'|0) = 0, the vacuum energy must vanish, (0|H|0) = 0. In the case of a scalar potential 
V(@), for example, the condition for the spontaneous breaking of the supersymmetry is that 
V(v) > 0 at the minimum. This is to be contrasted with internal symmetries, for which 
the relevant issue is whether v # 0 at the minimum, as illustrated in Figure 10.4. The 
requirement (0|H|0) > 0 can be violated in supergravity (gauged supersymmetry) or in the 
presence of explicit supersymmety breaking terms. 


?8The Coleman-Mandula theorem (Coleman and Mandula, 1967) states that under reasonable assump- 
tions the space-time symmetry of a field theory cannot be extended beyond the Poincaré algebra except for 
internal symmetry generators if the generators obey commutation rules. However, it can be extended by 
the inclusion of fermionic generators (Haag et al., 1975). 

29The algebra can be extended to N fermionic charges Qt,,i=1---N. The cases N = 2,4, and 8 are of 
considerable interest for theoretical discussions, but do not appear to be directly applicable as extensions 
of the SM. 
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Figure 10.4 Left: potentials which preserve internal symmetries. Right: potentials 
which break internal symmetries. In both cases, supersymmetry is preserved for 
V|min = 0 (solid) and spontaneously broken for V| min > 0 (dashed). 


The Qa and Qa acting on a single particle state either annihilate it or create a state with 
spin and helicity differing by 4 unit. The most important massless irreducible multiplets 
for our purposes are: (a) the chiral supermultiplets, which consist of a Weyl fermion and a 
complex scalar.°° Examples are the chiral quark or lepton fields and their partners, or the 
Higgs scalar and its partner. Chiral supermultiplets can be paired to form Dirac fermions. 
(b) The vector supermultiplets, consisting of a massless vector, e.g., with helicity +1, and 
a Weyl fermion partner. These are generally elements of the adjoint representation of a 
gauge group. A vector supermultiplet can be combined with its CP conjugate to form a 
massless vector with helicities +1 and a (massless) Majorana fermion.*! (c) The gravity 
supermultiplet, consisting of the spin-2 graviton with helicity 2 and its spin-3/2 gravitino 
partner. This can again be combined with its CP conjugate involving a graviton with helicity 
= —2, 

The massless chiral and vector supermultiplets are all that are needed for the MSSM. 
There are also massive representations, that are occasionally needed, e.g., for extensions 
of the SM involving an extended gauge symmetry broken at a scale large compared to 
supersymmetry breaking. We will illustrate the most important ones in Section 10.2.3. 
Before constructing the transformations of the supermultiplets and the rules for invariant 
Lagrangian densities, it is convenient to introduce Grassmann variables and superspace.*” 


Grassmann Variables and Superspace 


Supersymmetry transformations mix particles with different spins, and therefore can be 
considered an extension of the rotation and Poincaré groups. It is convenient to extend 
the notation of four-dimensional spacetime with coordinates x" into a larger superspace, 
involving two additional anticommuting (Grassmann) coordinates ĝa, with a = 1,2, and 
their conjugates ĝa = (@.)'. (Such Grassmann variables are similar to the ones introduced in 
field theory for dealing with fermions using functional methods.) The indices can be raised, 
lowered, or contracted using éag, just as for chiral fields € and x, or the supersymmetry 
generators Q and Q. Because of the anticommuting nature, the product of three or more 
6’s vanishes, 0,936. = 0, and 0,8. = 0 (no sum). This will enormously simplify the task of 


30The supermultiplets also involve unphysical (non-propagating) auxiliary fields. 

31t is convenient in this context to combine a Weyl spinor and its conjugate as in (10.58), even if the 
Majorana mass is zero. 

32OQur notations and conventions follow most closely those in (Wess and Bagger, 1992; Drees et al., 2004). 
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constructing general functions of superspace variables by expanding around @ = 0 = 0. Of 
course, E 

8? = 070, = —20102, 6? = ba bt = 20°67. (10.63) 
One can introduce the concepts of differentiation and integration in superspace, as are 


described in the more detailed books and articles. We will only utilize integration, defined 
by 


fo =0, [eat = fap, IEZ =0, (10.64) 
and similarly for dð. Thus, for example 
feat? = er | dB.005 = Gg — 10y = —20%. (10.65) 
It is useful to define 
d0 = — 540° dba, = — 50d, d*6 = d?6d?0, (10.66) 
from which one finds 
few = few =r [ror =i; (10.67) 


with all other integrals vanishing. These forms will be useful as projection operators onto 
the supersymmetric parts of operators. 

The Grassmann variables can also be contracted with and anticommute with spinor 
fields and supersymmetry generators, and can be formed into vectors. Two useful identities 
are 


1 en oie 1 - 
OE OX = 0%Eq 0P Xg = s EX, 0o”0 0o”0 = +59" 0°, (10.68) 


which can be derived by writing out the components and by using the Fierz identity in 
(10.57). 


A Simple Example 


Before we consider the general supersymmetry transformation rules, let us give a simple 
example. Consider the free field theory for a massless complex scalar ¢, a massless left-chiral 
Weyl spinor £, and an auxiliary complex field F, with Lagrangian density 


L = (ð p) "h + i58 E + FF. (10.69) 


F does not represent a physical particle (the Euler-Lagrange equation in this case is just F = 
0), but it is needed to ensure a supersymmetric action. The supersymmetry transformations 
of the fields into each other turn out to be 


E> E+ 66, E = V2€F — iV20"EI,.g (10.70) 
F>F+ôF, ôF =-iV2@5",€, 


where e and € are Grassmann parameters that define the transformation, analogous to a” 
and wP? for Poincaré transformations, or 6f for the internal global symmetries discussed 
in Chapter 3. It is straightforward to prove that £ is invariant under this transformation 
(Problem 10.6). It can also be shown that these transformations indeed form a representation 
of the supersymmetry algebra in (10.72). We note that the shift in F is a total derivative 
(since € is a constant). Therefore f d‘xF(z) is invariant under the transformation. This will 
be very useful in more realistic examples. 
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Superfields 


A superfield is an operator ®(x, 0,0) that is a function of the superspace coordinates x, 0, 
and @. One can expand © in a power series in 6 and 6, but no terms involving more than 
two factors of 0 or @ survive because of the antisymmetry. Consequently, the most general 
Lorentz invariant superfield is 


®(x, 0,0) = (x) + V20E(x) + V20X(x) + 00F (x) + 00G(x) 


- zo = 1 (10.71) 
+ 60"6A,,(x) +000A(x) + BO0K(x) + 3 9008D(2), 
where ¢, F, G, and D are Lorentz-scalar functions of z; €, ¥, k, and \ are Weyl spinors; 
and A,, is a vector. 

We are now ready to interpret the supersymmetry transformations as translations in 
superspace. The algebra in (10.60) can be rewritten in terms of commutators involving 
OQ = 0° Qa and OQ = 64Q", 

[0Q, 0Q] = 200"0 P,, [0Q, P| = [0Q, P] = 0 
[0Q,0Q] = [0Q, 0Q] = 0 (10.72) 
[0Q, Mp] = $v Q, (AQ, Mw] = O5 Q. 


Just as we defined the translation operator exp(+ia: P) in ordinary space in (10.23), we 
can define a supersymmetry translation operator as 


S(a, €, €) = ete te?) (10.73) 


where € and é are Grassmann variables like 8 and @. The multiplication of two group elements 
can be calculated exactly using the Baker-Campbell-Hausdorff construction in (3.6) because 
all of the commutators after the first vanish, with the result 


S(a",€,€)S(b", 6,6) = S(a" + bY + ieo"5 — iðo!E, e +ô, €+ 8). (10.74) 
The transformation of a superfield 6(x,0,) can then be found in analogy to (10.28), 
> P' = SPS! ~ O+ileQ+eQ+a-P, S], (10.75) 


where the last form assumes infinitesimal translation parameters. The action of the gener- 
ators can be found by casting them in the form of covariant derivatives with respect to the 
Grassmann variables. The derivation is carried out in the standard references, but here we 
will just give the result, 


D(x”, 0,0) > O(a", 0,0) = O(a +a” + ieo”0 — i00"E,0+6€,0+8). (10.76) 


A general superfield such as that in (10.71) is highly reducible, but special cases are 
irreducible and close under the transformations. We are most interested in left-chiral super- 
fields ®,(x,0,0), which only involve L-chiral spinors; right-chiral superfields ¢p (such as 
(®,)'): and vector superfields, which satisfy V = VÝ (such as dlor or ®; + ð$). A left- 
chiral superfield and its associated left-chiral supermultiplet involve two complex scalars, 
(x) and an auxiliary field F(x), and an L-spinor E(x), with specific relations between the 
components. It can be written 


®,(x,0,0) = olx) + V20E(x) + 00F (x) — i0o bð plc) 


0000" g(a ng RN 
= pale) + (Op€(a)) "8. 
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The supersymmetry transformation of ®; can be obtained from (10.76) (with a = 0), noting 
that, e.g., E 7 
O(a" + ieo"6 — 100%) ~ G(x) + ileo”0 — 0o"E)ð olr), (10.78) 


to obtain 


6b = Vet 
“Na” 
0€(x) 


ÔE = V2eF — qe Oud Fyre Ou = VF — iV 20%€ Oud 
Fa) a ~a (10.79) 
(x) 0orðð plx) 


2 a Filner = ~i/28"0,€, 


aga) 90(OnE(@))ovO 


ÖF = EO" One + 


where the origin of each term is indicated and we have used (10.68). This result reproduces 
the transformations in (10.70). ®; can be written in a more compact form by introducing 
the variable 

y! = x” — ido"6, (10.80) 


to obtain 
®,(x, 0,0) = Lly, 0) = ly) + V20E(y) + OOF (y), (10.81) 


which depends explicitly only on 0. Equation (10.81) is easily verified by expanding the 
fields and using (10.68). 

From now on we will drop the subscript L on a left-chiral superfield. Some important 
aspects to (re)emphasize are: (a) a chiral superfield transforms as an IRREP of the super- 
symmetry algebra. (b) The (auxiliary) F-component of a chiral superfield transforms as a 
total derivative, so that f d‘xF (2) is an invariant. (c) Superfields commute, ©; 02 = 624, 
because each term has an even number of Grassmann variables and spinors. (d) The product 
$P or sum ®, + ə of two chiral superfields is also a chiral superfield. This can be seen 
for the product by multiplying out the expressions for ®, 9, 


P1 = 12 + V20 (E162 + £261) + 00 (G1 Fo + oF — 182), (10.82) 


where all of the fields are functions of y. This clearly has the form of a left-chiral superfield. 
The same is obviously true for the sum. We also record the product of three left chiral 
superfields for later use: 


D1 D203 = bi g¢2b3 + V20 (E1b263 + 20301 + £36162) (10.83) 
+ 00 (b1¢2F3 + b263F1 + p31 F2 — €1€263 — 2361 — €361¢2) . 
The superpotential”? W(®,) is a holomorphic function of left-chiral superfields ®4, i.e., 


it depends only on the ®, and not on their adjoints or on other right-chiral or vector super- 
fields. We will mainly be concerned with superpotentials that are third-order polynomials 


33 An important consequence of supersymmetry is that the ultraviolet divergences are milder than in non- 
supersymmetric theories due to cancellations. One aspect of this is the non-renormalization theorems (Gris- 
aru et al., 1979; Seiberg, 1993), which include the statement that the superpotential is not renormalized 
to any order in perturbation theory. This restricts the form of renormalization group equations for the 
coefficients, since couplings are only renormalized by wave function effects. It also means that if a term is 
absent from the superpotential or very small, it will not be generated by renormalization. 
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of the superfields, but in principle W could be any holomorphic function expressible as a 
power series. It follows from the above that W is itself a left-chiral superfield, and that its F 
component is a supersymmetric invariant of the action. The F component, denoted [W] F, 
is given by 


aw 
WwW 2 ee X == 32 
= = fa We) Faaa, = baby ETE Ws aa aoe) 


=0=0 


where we have used (10.67). The second form is most easily seen from the examples in 
(10.82) and (10.83). 

A vector superfield V = VÝ is another important special case of (10.71), with the 
restrictions 


e=o, =x, FSO, Ap=A}, à=, D=D!, (10.85) 


One can use (10.76) to show that the change in the (auxiliary) D component of a vector 
superfield under a supersymmetry transformation is a total derivative, and therefore D(x) 
in an invariant when integrated over x. From (10.67) and (10.71), 


1 
[Vlp= fov = zP). (10.86) 
The special case V (x, 0,0) = $, [®a(y, o)? ®a(y, 0) leads (Problem 10.8) to 


/ dto XO [Ga(y,9)]' Daly, 0) = X [Oupa] 0 ba + EaG" Our + FLFa], (10.87) 
a a 

which we recognize as the kinetic energy terms for a and a and the auxiliary field term 

from (10.69). Vector superfields are also used to describe gauge bosons and their partners, 

as will be discussed below. 

The supersymmetry transformations associated with S(a, €, €) in (10.73) are global, i.e., 
€, €, and a are constants. They can be promoted to local (gauge) supersymmetry by allowing 
the parameters to be functions of spacetime, just as in an ordinary gauge symmetry. Local 
supersymmetry necessarily implies the existence of gravity, and is therefore known as su- 
pergravity. Supergravity is nonrenormalizable, and would presumably be an effective theory 
below some cutoff scale, e.g., the Planck scale, where it could emerge from an underlying 
superstring theory. 


10.2.3. Supersymmetric Interactions 

Yukawa and Scalar Interactions 

The Lagrangian density for a supersymmetric theory of scalars and fermions (but no gauge 
interactions) can be written 


L = |[K]p + ([W]r + k.c.) = pee K(®, t) + (/ d 0W(®) + he.) , (10.88) 


where W(®) is holomorphic and the Kähler potential K(®, ®t) is Hermitian. We will focus 
on the simplest case** 


K(®, ð) = 5 [®a (y, 0) aly, 9), (10.89) 


a 


34More general forms occur in supergravity and in models of supersymmetry breaking. Also, some authors 
reserve the term Kähler potential for related functions relevant to supergravity. 


454 The Standard Model and Beyond 


which yields the canonical kinetic energy terms in (10.87). The expression for [W] p is given 
in (10.84). The auxiliary fields F, can be eliminated by using the Euler-Lagrange equations 
of motion, 


ow _ ow! 


Fİ = = —-W,(¢), F, = 
0®, 6=6=0 O®, 6=0=0 


= —[W,(4)]". (10.90) 


The terms involving F, therefore lead to the scalar potential V(¢), with 
V(b) =- 9 (Fl Fa + FaWa + FIW!) = $ Fal? = $ Wale). (10.91) 


As expected, V (¢) is non-negative, and the condition for supersymmetry to be unbroken is 
for Wa ($) = 0 at the potential minimum.*? We also define 


= ÆW 
Wal) = Did aaao (10.92) 
so that 
L=LgeE+Ls-—V(¢), (10.93) 
where 
Lene 5 [(3 pa) O pa + ifa" 3pEa] 
E (10.94) 


Ly = -2 E (EokaWeald) + Enë [Was(9)]") 


a, 


are, respectively, the kinetic energy and fermion mass/interaction terms. 


A Majorana Fermion 


As a first example, consider the Wess-Zumino model (Wess and Zumino, 1974a), which 
contains a single left-chiral superfield ®. We choose 


W(®) =ab+ =e # no, (10.95) 
with a, m, and h real. This implies 
L = (nb) + iOH OE — T (EE + EE) — W(EEd + EEG!) — V (9). (10.96) 


The potential is 

V(d) = |a + mọ +h??? , (10.97) 
which has supersymmetry preserving minima at the zeros of a + mọ + hd?. Let us take 
a = 0 and consider the minimum with (¢) = 0, so that 


L=Lre — 5 (66 + EE) — need + Ed") 


(10.98) 
— m? |g)? — hm|gl?(o + 4) — h?|gl?. 


35This is generalized to V = e(¢hoa/Mp) [|Wa|? — 3|W|?/M2] in supergravity with a minimal Kahler 
potential, where Wa = Wa + o,W/M2, and Mp is the Planck mass. The condition for supersymmetry to 
remain unbroken turns out to be Wa = 0 at the minimum, which allows V < 0. 
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We recognize the €€ + €€ term as a Majorana mass term, as in (10.59), so the spectrum 
g 


consists of a Majorana fermion Ym = ( > ) = YL + yf and a complex scalar ¢, both with 


£ 


mass m. The Majorana fermion mass and kinetic energy terms in four-component notation 
are 

= M,- oe -e 1- , m - 

bri pb, — 5 (VLYR + VRUL) = geet Ou — z eM; (10.99) 
as in (9.7) and (9.9). There are Yukawa interactions 


Lyuk = —h (PRL + ULV Rd!) = —h (bu Prbue + bu Prime’), (10.100) 


as well as cubic and quartic interaction terms for ¢, with coefficients related by supersym- 
metry to the Yukawa coupling. It is sometimes useful to write ¢ in terms of Hermitian 
components, ¢ = (S + iP)/./2, where S and P indicate scalar and pseudoscalar. Then, 


v2 (10.101) 


It is interesting to consider the special case m = 0. From (10.98) we see that £ has 
a global U(1) phase symmetry under € — e*%°€, p — e~?"%¢. However, the superfield 
® does not appear to have a simple transformation property. This is an example of an 
R-symmetry (e.g., Intriligator and Seiberg, 2007; Dine, 2015), which does not commute 
with supersymmetry because different components have different charges. One can formally 
express the R-symmetry by assigning a transformation to the Grassmann variable 0, i.e., 


6 8o, d26 ae eh Pp, W => eZ BY, (10.102) 
which implies 


Delto, ged, geie Fe SF, (10109) 


A Dirac Fermion 


As a second example, consider three chiral superfields U, US, and H. The symbols are 
chosen to be suggestive that the fermionic components of U and U® will combine to form a 
Dirac field (i.e., the u quark), while H will play the role of the Higgs, but at this stage they 
are three independent superfields and there are no gauge interactions or chiral symmetries. 
We denote the scalar and spinor components as (i, £u), (ŭ°, Nu), and (h, h), respectively, 
where the use of 7, rather than ¿£ is motivated by (10.54) and the subsequent discussion, 
and the notations i, ã°, and h for the scalars and A for the H spinor are suggestive of the 
MSSM. We choose the superpotential 


W = mUU® + h,UUCH + W(H), (10.104) 


where m and h, are real and W depends only on H. We first take h, = 0 and W = 0, so 
that 
L= LKE =m (Mufu oi Euu) a m? (ŭl? + S) 


7 7 7 i (10.105) 
= Lge- m (ŭürUL + tur) —m? (\&|? + lã?) ; 
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bu 


which corresponds to a Dirac fermion u = ( J ) = ur + ur, as well as two complex 
scalars, u and u°, all with mass m. Including h and W, 
L = Lge + Lu + Ly — Vù, ù, h), (10.106) 
with 
Lu=- m (ues + uhu) = hu (Nuêuh + Euuh?) 
— hu (heui + Eha) — hu (nh + hijuit) 
(10.107) 
=-m (tRUL + ULUR) — hu (ūruLh + ūrurh’) 
= hy (Peuri + ashpa”) = hy (anki + hyupit') 5 
where hz, is the L-chiral Higgsino field corresponding to h, and he is its CP conjugate. 
Similarly, 


1 ~~ 77 
Ly = -5 (MWun (h) +AWun (ht), (10.108) 


and 


V(ŭ, &°, h) = [Mmi + hut h|? + |i + hy Gh|? + |hy titi? + Wy (h) 


2 
| (10.109) 


Nu 
mass h,,(h) if h acquires a VEV (which we assume is real). h is either a massless Weyl 
spinor or acquires a Majorana mass, depending on W (we assume that (ŭ&) = (a°) = 0). 
From (10.107) we see that the Yukawa couplings ūruz,h and ūruprht are accompanied by 
analogous couplings in which one fermion and one scalar are replaced by their superpartners, 
as illustrated in Figure 10.5. 


Again, u = ( Eu ) is a Dirac field with a bare mass m and/or a spontaneously generated 


UR; RA UR, 
k a 

—thy, p--=-- ils e E 
i a 
T = T 


Figure 10.5 Yukawa vertices from (10.107). There are three more diagrams in which 
the incoming and outgoing particles are reversed. 


Abelian Gauge Interactions 


Let us first consider a U (1) gauge symmetry. The gauge boson A,,(x) is a component of a 
vector superfield V, which can be written 


V (a, 0,0) = (x) + V20x(x) + V20X(x) + 00G(x)t + 60G(a) 


_ D _ 1. (10.110) 
+ 0o BA, (2) + 000N(x) + BOAC) + 5 0000D(2), 
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where ¢, A,,, and D are Hermitian. The form of V can be simplified by noting that V + 
iA —iAt is also a vector superfield for any chiral superfield iA. In particular, if ¢, is the 
scalar component of A, then from (10.77) the supergauge transformation 


VoV'=V+iA-iat (10.111) 
generates a new vector superfield with vector component 
Al, =A, + O,(ba + oh), (10.112) 


which is just an ordinary U(1) gauge transformation of the form in (4.9) on page 136. The 
D component [V]p = D(x) is not only supersymmetry invariant (when integrated over x), 
but is invariant under supergauge transformations. 

The other components of A can be used to put V in the form 


— _ = on I e 
V(x,0,0) = 000A (x) + 000A(x) + 000A(x) + 3 9008D (2). (10.113) 
This form, known as the Wess-Zumino gauge (WZ) is not manifestly supersymmetric, i.e., a 
supersymmetry transformation takes one back to the general form. However, it is extremely 
useful because it only involves the relevant physical degrees of freedom, i.e., the gauge boson 
An, its superpartner the gaugino À, and the auxiliary real D field, and is therefore analogous 


to the unitary gauge. One still has the freedom to perform ordinary gauge transformations 
while remaining in the WZ gauge. It is useful to note that in the WZ gauge 


3. dh. .=e 
V?’ (x,0,0) = 30000A" (x) A, (2), Vay S20) (10.114) 
Under a supergauge transformation, a chiral superfield ®, and its conjugate transform as 
Ba e279 AG, BE @t et Pi9GaA (10.115) 


where g is the gauge coupling and qa is the charge of ġa. It is easy to see that the special 


case of an ordinary gauge transformation reproduces (4.9), with 8(x) = —g[@a(x)+¢a(a)']. 

Combining (10.111) and (10.115), we see that ®t e?94aV ©, is supergauge invariant, and that 
Ve — 29qa V 

Ly = pes Yo ptes g, = D Pi e79aV p, > (10.116) 


which generalizes the chiral kinetic energy term in (10.87), is supersymmetric and super- 
gauge invariant. Lg can be written in terms of the component fields by expanding the 
exponential and using (10.114). One obtains 


Ly = Dp [(Daupa) Diba + ifa” Data = V2gqa (Ea Apa F bh Eada) 
a (10.117) 
+99a¢)¢aD + FÌ Fal, 


where Dap is the gauge covariant derivative, i.e., 
Dan Pa = (Op + igqaAu)Qa, Dapa = (On + t9qaAp)&a- (10.118) 


We recognize the first two terms as the gauge covariant kinetic energies for ġa and £a, while 
the third is a related fermion-scalar-gaugino interaction required by the supersymmetry. 
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The construction of the kinetic energy terms for A, and A is rather involved. It is possible 
to construct a supersymmetric derivative of V that contains the field strength F y and a 
derivative of A and that transforms as a chiral supermultiplet but with a spinor index. The 
F term of the contraction of this field with itself°° yields the gauge kinetic terms 


1 z 1 
La = FFF + ia, + 5D. (10.119) 
One can also write a superpotential including any terms that are U(1) invariant, such as 
PaPa P. provided qa +q +4qe = 0. Finally, for the special case of a U(1) gauge symmetry, one 
can use the fact that D(x) is gauge and supersymmetry invariant to add a Fayet-Iliopoulos 
(FI) term (Fayet and Iliopoulos, 1974) 


LFI = KD(x) = 2K|V]p, (10.120) 


where « is a constant. There is no analog of the FI term for a non-abelian gauge symmetry 
or for a U(1) embedded in a non-abelian group. 
The auxiliary field D enters Lg, £4, and £r;. The Euler-Lagrange equation implies 


D(x) = - (s+ > 9¢adh¢a). (10.121) 
Putting everything together, 
1 ~ 1 ane 
L=- gE EF" + Ao" OA — 5 5 (E0&a (4) + Fake Wav(9)") (10.122) 
a,b 


+ JO (Dauhe) Diba + 1&5" Daya — V294a(EaAba + $1LAEa] — V (9), 


where 1 
v(e)=5 F,|? + -|D = Vp + Vp. 10.12 
(9) l | z| | F D (10 3) 


Fa and D are given by (10.90) and (10.121), respectively. 
As an example, consider a single chiral superfield ® with q = 1. There is no gauge 
invariant holomorphic term in ®, so the superpotential vanishes, and 


1 E 2 
L=- FF FH” + ida", A + Q + igA,)o| 


+ i€o"(O, + IgA, )é — V2g(EAd + PAE) — V(9), (10.124) 


where 


V(¢) = tipp? = sl -gpt (10.125) 


For k/g > 0 the minimum of the potential is at ¢ = 0, so the gauge symmetry is unbroken. 
However, V = |D|?/2 = k?/2 4 0 at the minimum, so the supersymmetry is broken. This 
is manifested by the fact that the complex scalar ¢ acquires a mass 


i, =g9D| in = 95, (10.126) 


36More general supersymmetric but non-renormalizable gauge kinetic terms, encountered in supergravity 
and string constructions, involve an additional gauge kinetic function that is holomorphic in the chiral 
superfields. 
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while the Weyl spinor € and the vector supermultiplet remain massless. The massless £, 
known as the Goldstino, is characteristic of a spontaneously broken supersymmetry. It is 
the analog of the Nambu-Goldstone boson of an internal symmetry. 

For «/g < 0 the minimum of the potential will be at (¢) 4 0, so the gauge symmetry 
will be spontaneously broken. Writing « = —gv?/2, where v is real and positive, one can 
choose (¢) = v/V/2 and ¢ = (v + h)/V2 (in unitary gauge), where h is Hermitian. Since 
D = 0 at the minimum, the supersymmetry is preserved. The spectrum consists of one 
massive scalar (h), a massive vector A,,, and a massive Dirac fermion formed by combining 


the fermion from the chiral supermultiplet with the gaugino, Y = ( ‘ | All of these have 


mass gv, i.e., they form a massive vector supermultiplet. 


Non-abelian Gauge Interactions 


Equation (10.122) generalizes in a fairly obvious way to the non-abelian case (which is given 
for a nonsupersymmetric theory in (4.16) on page 140). One finds 


L=- a + iA" Dyp A = : Y (Gok Wa0(d) + Eko Wav(9)") (10.127) 
a,b 


+ |(Da,0)!' Dd + iGo" Dat — VIEN Lipo + PALE] — V(9), 
where 


a. pe 2 412 a tri 
V(6) =Vr+ Dp = > Fa? +5 > ID}, Di = -g Lid. (10.128) 


L', is the group representation matrix for the matter fields (which is the same for ¢ and € 
but which may be reducible). The D term can be written more explicitly as 


Vp = 3 |D? = 5 $ l-9} (L5) do] - (10.129) 
In (10.127) De and D, are, respectively, the covariant derivatives for ® and A. 
Do, = O,I +igA Ls, Dy, = pI +ig Alig, (10.130) 
where (Liaj) jk = —iCijk is the adjoint representation matrix. Thus, 
(DapA)s = ByAs — gear AL Ar, (10.131) 


which should be compared with (4.27). The gaugino-gaugino-gauge interaction can be 
rewritten in four-component notation as 


—igcijr A TH AI A" = —igcijr A y" A] AF = —igcijk AR" AAR 
io o o; n (10.132) 
= 55I Cijk AMY" ALAM, 


with A > Ah = ( a ) = àf + A$$. The vertices for the gauge interactions are shown in 
Figure 10.6. 
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Figure 10.6 The three-point gauge and gaugino vertices corresponding to (10.132), 
(10.127), and (10.133). (There are two ways to contract the gauginos in the first 
diagram.) The notation pq,» is defined in Figure 4.1. The double lines represent the 
Majorana gaugino Aj, = Aj, + Aig. There are additional scalar seagull and gauge 
self interactions, which are shown in Figure 4.1. 


Similarly, the fermion-scalar-gaugino interactions are 
~ V2q[EaX (Lis) ap G6 + OLY (L5) as | 


ae 7 (10.133) 

= —V2g [arà (Li) ap bo + GA‘ (LS). bor l 
where the second form is in four-component notation, with éa —> Yar. Frequently, we 
consider a reducible representation involving pairs of left-chiral superfields ® and ®° whose 
fermionic components will eventually pair to form Dirac fields. Similar to the example 
described above (10.105), we write their components as (¢,€) and (°, n), respectively. 
Allowing for the possibility of a chiral gauge symmetry, we take Le = Lz and Lec = -LẸ 
for their representation matrices, where Lr, and Lp are the representations for the L- and 
R-chiral fermions wr + € and Yr + ñ. The fermion-scalar-gaugino interactions in this case 
become 


— V2g|EaX (Lin) a, Po + GL (Li) gy E — DEX (Lhe) pq Ta — MA! (Lie) 4, O2 
= -V2g [ards (Li) ap Pon + plr Am (Li) ap VoL 
—hpXiu (Lh) pa Var — oră (LR) 5 ban| , (10.134) 


where in the second expression we have introduced the suggestive notation @, > Qaz and 
da > O61 = Gar: 
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10.2.4 Supersymmetry Breaking and Mediation 


Since we do not observe degenerate supermultiplets, supersymmetry (if it is present at all) 
must be broken (for reviews, see Chung et al., 2005; Luty, 2005; Intriligator and Seiberg, 
2007). Just as with an ordinary symmetry, the breaking can be explicit or spontaneous. 
There are a number of reasons that the breaking should be spontaneous, including the 
possibility of a naturally small supersymmetry breaking scale (compared to the Planck 
scale), limiting the number of parameters, suppressing flavor changing effects, and allowing 
a consistent extension to supergravity or superstring theory. Nevertheless, the effective 
theory relevant at low energies may well involve explicit symmetry breaking terms. These 
should be soft (i.e., mass terms and cubic scalar interactions, with dimension < 4), rather 
than hard to avoid the reintroduction of the Higgs hierarchy problem. 

Spontaneous breaking occurs when the F and D terms cannot all vanish simultaneously. 
We already saw one example of D-term breaking in the abelian gauge model in (10.124) 
involving a single charged superfield. For «/g > 0 the D term could not vanish, so the 
complex scalar acquired mass while the Weyl fermion (the Goldstino) remained massless. 
It is also possible to construct F-term breaking (O’Raifeartaigh) models, in which one or 
more of the |F,| is nonzero (O’Raifeartaigh, 1975). Consider three chiral superfields with 
superpotential 

W = m®203 + h® (03 — p’), (10.135) 


where for simplicity we will take m, h, and u? to be real and positive. Then 
FR =—-h(¢3-u?), Fi = -md¢d3, F% = —md2 — 2hdids, (10.136) 


which obviously cannot all vanish simultaneously, so Vp > 0 at the minimum. For example, 
for m? > 2h?y? the minimum occurs for (¢2) = (¢3) = 0, with (¢1) undetermined (i.e., 
there is a flat direction at tree level), for which 


Vr = |F|? = h2p4 > 0. (10.137) 


Quantizing around the minimum (taking (¢)) = 0) yields scalar mass-squares 


m? = 0, m2 =m’, map =m — h?p’, me, =m? +h? R, (10.138) 
where $3r,; are the Hermitian components of @3, i.e., 3 = (¢3r + i¢3r)/V2. The fermion 
mass terms are from 

Ly = —M&2&3 + h.c. + Yukawa terms, (10.139) 


so that €2 and €3 combine to form a Dirac fermion with mass m, while €; remains massless. 
Similar to the D-term breaking example, é is the massless Goldstino associated with the 
spontaneous supersymmetry breaking. As should be intuitive, it is associated with the 
supermultiplet with the nonzero F term. 

The existence of the Goldstino is a generic feature of spontaneous breaking. In supergrav- 
ity theories it is “eaten” to become the helicity +4 components of the massive gravitino 
(the super-Higgs mechanism). These issues are discussed in detail in the more extensive 
treatments listed in the bibliography. 

Tree-level spontaneous supersymmetry breaking implies sum rules relating the mass- 
squares of the fermionic and bosonic degrees of freedom. For example, the sum of the scalar 
mass-squares for the O’Raifeartaigh model in (10.135), 


mh = 2m] + 2m + mp + M3r = 4m’, (10.140) 
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is the same as that for the fermions, where in each case one must weight by the number of 
degrees of freedom (2 for a complex scalar, 4 for a Dirac fermion). There is a similar sum 
rule for the D term example in in (10.124), which however, involves a contribution from 
|D|. For this reason such tree-level breaking is not a phenomenologically viable option for 
the MSSM, because the sum rules would require that some of the superpartners are light, 
contrary to observational limits. 

These constraints can be evaded if the breaking is radiative, i.e., associated with loop 
effects, or if it occurs in a hidden sector that is only weakly coupled to the SM particles. The 
hidden sector models are especially promising, e.g., because they may allow for suppressed 
flavor changing effects (although FCNC are still problematic in some cases). Breaking could 
occur in the hidden sector by tree level F or D mechanisms, or the breaking could be 
dynamical,*" i.e., associated with some strong dynamics in the hidden sector (e.g., Witten, 
1981, 1982; Affleck et al., 1985). The latter possibility could explain why the supersymmetry 
breaking scale is small compared to the Planck scale. For example, an asymptotically free 
gauge group that is weakly coupled at the Planck scale might become strong at some 
intermediate scale As < Mp, just as QCD becomes strong at Agcp, leading to an effective 
F term and a gravitino mass (in supergravity) m3/2 ~ F/Mp. The breaking could be 
associated, for example, with a gaugino condensate, 


(A AI) =O ANG, (10.141) 


analogous to the dynamical breaking of the chiral SU (3) z x SU (3) r flavor symmetry of QCD 
or with some dynamical alternatives to the Higgs mechanism for electroweak breaking (Hill 
and Simmons, 2003). In this case, one expects F ~ A} /Mp. 

The supersymmetry breaking in the hidden sector must somehow be communicated to 
the MSSM particles, leading to effective soft masses for the gauginos, Higgsinos, squarks, 
and sleptons, as well as soft cubic scalar couplings, with a typical scale Mmsoft. This should be 
in the 100 GeV-few TeV range if supersymmetry is relevant to the Higgs/hierarchy problem. 
There are a number of possibilities for the mediation mechanism. One is supergravity medi- 
ation, in which the sectors are connected by higher-dimensional operators with coefficients 
suppressed by inverse powers of Mp. One usually finds Msoft ~ F/Mp or Msoft ~ A3/MZ, 
suggesting VF ~ 10!! GeV. Simple versions of supergravity mediation may lead, e.g, to 
squark masses that are family universal at the Planck or GUT scale. However, RGE effects 
lead to splitting at lower energies, with possible difficulties for FCNC. The gravitino mass 
in supergravity theories is comparable to Msoft- 

In the original gauge mediation models (e.g., Giudice and Rattazzi, 1999), the informa- 
tion about supersymmetry breaking is transmitted by messenger fields that interact with the 
hidden sector and also are charged under the SM gauge group. The effective soft breaking 
(due to loop effects) leads to 

Mesoft ~ Ar (10.142) 
where M is the mass of the messenger particles. For VF ~ M one finds the much lower 
scale VF ~ 104 — 10° GeV, implying a very light gravitino. Since F is low and the SM 
gauge interactions are family universal, gauge mediation is much less problematic for flavor 
changing effects. Gauge mediation is treated generally in (Meade et al., 2009) and reviewed 
in (Kitano et al., 2010). 


There are many other possibilities, including anomaly mediation, gaugino mediation, 


871t is likely that such a supersymmetry breaking vacuum would be metastable, with a nearby supersym- 
metric minimum (Intriligator et al., 2006). 
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radion mediation, and D-term mediation, as reviewed in (Chung et al., 2005; Heinemeyer 
et al., 2006; Patrignani, 2016). (Deflected) mirage mediation (e.g., Everett et al., 2008) 
combines some of these possibilities. Z’ mediation is mentioned in Section 10.3. 


10.2.5 The Minimal Supersymmetric Standard Model (MSSM) 


It is straightforward in principle though complicated in practice to write the Lagrangian 
density for the MSSM. Here, we will show the main features, but for the details of the 
squark, slepton, neutralino, and chargino mixings, CP phases, etc., the reader is referred 
to the reviews and books that have already been cited. We will follow a notation similar 
to the example in (10.104), using upper case letters for left-chiral superfields and upper 
case letters with a superscript © for the left-chiral conjugates of right-chiral fields. Thus, the 
quark and lepton superfields will be written 


Q= ( A ) U? D; L= ( - ) N° E’, (10.143) 


respectively.” (Some authors use a hat notation, such as Q, to indicate a chiral superfield.) 
Their scalar components will be written as, e.g., 


i=t,, Wau =ù} (10.144) 
for U and U*, and the fermion spinor components as 
Eu © UL; Nu uy [Mu H uR]. (10.145) 


We will usually not display family indices, or superscripts ° for weak eigenstates, as in (8.4) 
on page 258, but these can easily be added. The Higgs chiral supermultiplets are 


H+ H9 
T Hô Ji Ha = ( He i, (10.146) 


with scalar and fermion components such as h} and ht + ht, for Ht. The indices for 
SU(2) doublets are contracted using €??, just as for the spinor indices of the Lorentz group. 
Unlike the spinor case, however, the order is significant since chiral superfields commute. 
For example, 

QH,, = Qa Hu = UHL — DHT = -—H,Q. (10.147) 


SU(8) color indices are unambiguous, e.g., QU = QgU™ or UDES: = eap UDP SeT. 
The superpotential for the MSSM (assuming a conserved R-parity) is 
W = uH, Hy —T¢QH,D° + I"QH,,U° —T°LHyE* + T’LH,N° 
= (H} H} — H?H$) -T° (UHZ — DHÌ) D° +T” (UH) — DHT) U" 
-T° (NH; — EHj) E° +T” (NH) — EHX) NS, (10.148) 
where the first term yields supersymmetric masses u for the Higgs and Higgsino fields, and 


the others yield the Higgs Yukawa vertices and their supersymmetric partners. The signs 
are chosen so that the fermion mass terms have the “correct” signs for positive Tih} 4) 


38We have included the right-handed neutrino fields N° as an (optional) part of the MSSM, similar to 
our definition of the SM in Section 8.1. 
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Family indices such as in (8.13) are suppressed. The kinetic energy and gauge interactions 
are derived in an obvious way from (10.127) and will not be displayed explicitly here. 

The effective soft breaking terms for the MSSM include mass-squared terms for the 
scalars, Majorana mass terms for the gauginos, a scalar analog of the u term (the Bu term), 
and scalar cubic interactions (A terms) similar to the superpotential Yukawa couplings. 
Thus, 


z= ~ 3 ~ ~ z= 
Lsoji =- molo, — T (BB + BB) - =H (wiw + wi’) 
r {=l 
mA Š =.= 

_ Mme Ai Fri Fai Pi _ dx) Fe 

f (ee + GiG’) + | Buh,ha + Aal ähad 
—A,T“Ghyi? + Aer Ühaë: — ALT’ 6h, + hc l (10.149) 
where ¢ = 5 = = } and the same conventions for the SU (2) indices hold as 


in the superpotential. In the first term, }>,, sums over all of the scalars (Higgs, squarks, 
sleptons). The sum could be generalized to allow Hermitian mass-squared matrices for 
each scalar sector (q, u°, d°, etc), but without loss of generality one can choose family 
bases in which they are diagonal (this does, however, fix the family bases for the Yukawa 
matrices in (10.148).) Au, Ag, Ae, and A,, which are matrices in family space, and B have 
dimensions of mass. We have extracted factors of w and the Yukawa matrices I from the 
coefficients, because this occurs naturally in some (but not all) models of supersymmetry 
breaking/mediation.°? The special case in which the A’s are each multiples of the identity 
corresponds to minimal flavor violation (Section 8.6.6). Equation (10.149) is almost“? the 
most general soft breaking allowed for the Rp-conserving MSSM. 

In general (10.149) contains many free parameters after including family indices, but 
specific models, motivated by theoretical considerations, phenomenological constraints (e.g., 
from FCNC and CP violation), or simplicity typically have fewer. Most of the early phe- 
nomenological studies were motivated by supergravity mediation. For example, the con- 
strained MSSM (CMSSM) or minimal supergravity (mSUGRA) models assume universal 
values for the soft parameters at the Planck or GUT scale, 


mg = My =Me=M1/2, Mr = Mo, A, = Ag= Ae = A, = A, (10.150) 


where the universal A parameters multiply the full Yukawa matrices for each sector. It is 
usually further assumed that the soft parameters and p are all real, so that altogether there 
are only 5 dimensionful parameters, m1/2, Mo, A, B, and u. Some specific models further 
relate the universal soft breaking parameters m1 /2, Mo, and A, as well as B, to each other 
and to the gravitino mass m3/2 (see, e.g., Martin, 1997; Patrignani, 2016). (Some authors 
reserve the term “mSUGRA” for some of these more restricted versions.) In any case, these 
parameters must then be run down to the electroweak scale (e.g., Chung et al., 2005), which 
induces mass splittings and flavor off-diagonal couplings to the mass eigenstate fermions. 
Two of the parameters, u and B, are then typically traded for Mz and tan 8 = (h9)/(h9). 


39There is no uniformity in the literature about the signs of the A terms or whether to extract u and I. 

40 An explicit Dirac mass term for the Higgsinos can be absorbed in yu as long as By is free. One could add 
non-holomorphic cubic terms involving the “wrong” Higgs field, such as Gio2hl,)de, but in most symmetry 
breaking/mediation schemes these are suppressed by an additional factor of Msoft/M and are therefore 
negligible. [A possible application for small Dirac neutrino masses is considered in (Demir et al., 2008)]. 
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These are actually derived quantities, but Mz is known and tan is closely related to 
observables. The basic parameters are then m1/2, Mo, A, Mz, and tan £, as well as the sign 
of u, which is not determined, as well as the gauge couplings and fermion spectrum. 

The CMSSM is extremely simple and is a useful benchmark, but it is likely that the real 
world is more complicated (assuming low energy supersymmetry exists). The nonobservation 
of evidence for supersymmetry in the early LHC running and also the heightened theoretical 
realization that there are really an enormous number of possibilities have led to considerable 
interest in more general possibilities, such as extending the CMSSM to allow Higgs masses 
at the Planck scale that differ from those of the squarks and sleptons. There have also been 
extensive studies of models motivated by mediation methods other than supergravity, such 
as gauge mediation and the other alternatives mentioned in Section 10.2.4, which can have 
very different spectra. 

Even these more general frameworks leave open the possibility that some interesting 
cases might be missed. It is not really feasible to examine the entire > 124-dimensional 
parameter space. However, the possibilities are reduced to a manageable level by making 
some reasonable simplifying assumptions in the phenomenological MSSM (pMSSM) (e.g., 
Djouadi et al., 2007; Berger et al., 2009), ic. no R, violation; no new CP violation or 
FCNC at tree level; degeneracy of the squarks and sleptons of each type between the first 
two genrations; and that A terms for the first two generations can be neglected. There 
remain 19 new observable parameters (ignoring N°): 

tanB, Ma, M, Mawes Lee ee Mas e be hze At br, (10.151) 
2 ~ « 
qe de iee 


squares for the first two families, and m z, refer to the (E b)T and (i, #)T doublets. 
3,43 


Another useful framework for analyzing experimental data (or for exploring the sensi- 
tivities of an experiment or analysis method) are simplified models (e.g., Alves et al., 2012). 
These are effective Lagrangians involving just a small number of particles and parameters 
that describe the essence of some process found in a more complicated model, such as the 
production of a gluino followed by its decay into jets and an unobserved LSP. Simplified 
models can also be parametrized in terms of quantities closely connected to what is actually 
measured, such as masses, cross sections, and branching ratios. They can be applied to any 
type of new physics, not just supersymmetry. 


where My is the Higgs pseudoscalar mass in (10.157), m are the scalar mass- 


The Higgs Sector 


The scalar potential V (hu, ha, q, l ae, de, De, é€°) includes F-term, D-term, and soft contri- 
butions,*! 
V = Vr + Vp + Vooft- (10.152) 


We will assume that the minimum of V does not violate color, electric charge, B, or L, i.e., 
that the VEVs of the squark and slepton fields are zero.*? Keeping only the Higgs doublets 


41For more detailed discussions of the Higgs sector of the MSSM and extended supersymmetric models, 
see (Gunion and Haber, 1986; Gunion et al., 1990; Carena and Haber, 2003; Djouadi, 2008b; Accomando 
et al., 2006; Patrignani, 2016). 

“2This is not automatic and leads to restrictions on the soft parameters, especially the A parameter 
associated with the top quark Yukawa (e.g., Casas et al., 1996). 
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in V, the contributions are 


Ve =n)? (lhal? + [hal’) 


2 2 12 2 
Vo =< |h F hu + hiha] +7 [lhu]? — Ihal?]| 
peg 2 g ; (10.153) 
= hul? = hal?) + [Ratha — huha 
Veoft =m? |hul? + m? lhal? — [Bu (nono — hh7) + h.c], 
where 
hul? = |h? + IASI, hal? = hS + lrg’, (10.154) 


we have used the Fierz identity in Problem 1.1 to rearrange Vp, and have assumed the 
U(1)y FI term is zero. One can make a field redefinition so that By is real and positive, 
and an SU (2) x U (1) transformation so that (h*) = 0 and v, = V2(h°) is real and positive. 
Vp is then minimized for (hz) = 0, while the By term is minimized for vg = V/2(h9) real 
and positive. That is, electric charge and CP conservation in the Higgs sector are automatic 
at tree-level in the MSSM, provided the squarks and sleptons do not have VEVs.*? This is 
to be contrasted with general extensions of the SM involving additional Higgs multiplets 
which do allow electric charge violation (Problem 8.3 and 8.4). 
The potential in terms of 1,4 is 


1 1 
V (vu va) = 5m2v2 + Smr? — Bpvuva + LL (v2 13)” 
2 2 32 
a Py oo a (10.155) 
= 5 (Yu Ya) M (z) 32 (v3 — vi) ’ 
where 
2 
2), 12 2 21, )2 2 Oz mg —Bu 
m= jum, m= +m, M= ey a) (10.156) 


and it is understood that Vu, are real and positive (or at least nonnegative). We see that 
at tree level V depends only on three parameters, m?, m3, and By (as well as known 
gauge couplings). One combination of these will be fixed by the requirement that v? = 
v2+v2 ~ (246 GeV)? at the minimum. All aspects of the tree-level Higgs sector are therefore 


expressible in terms of the two remaining parameters, which we will take to be 


M2 = m2 +m? and tang = —. (10.157) 
Vd 


The quartic term in V vanishes for vy, = vg, so vacuum stability requires 
m? + m2 > 2Buy. (10.158) 


Electroweak symmetry breaking requires that the origin is not a minimum, i.e., that one of 
the eigenvalues of M? is negative (the vacuum stability condition (10.158) does not allow 
both to be negative), so 


detM?<0 => mm < (Bu). (10.159) 


43C P violating VEVs may be induced by loop effects in the effective potential associated with the Yukawa 
and soft couplings. 
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If (10.158) and (10.159) are satisfied, one can easily minimize V to obtain 


2 2 2 
| 2 M2 m}, tan B- Mas 


2 tan? 8-1 
2Bu  2Bp 
m2+m2 M2’ 


(10.160) 
sin 26 = 


where M? = (g? + g’)v?/4. (See Problem 3.35 for a very similar calculation.) The first 
equation relates v? to the other parameters. We have separated |u|? from the soft masses to 
emphasize that electroweak breaking involves a relation between u and the soft parameters 
that often requires a nontrivial fine-tuning. Note that 0 < sin28 <1 (0 < 8 < 7/2) by our 
phase conventions and vacuum stability. Most radiative breaking schemes lead to mi, <0 
(or at least m: < m2) because of the large top- Yukawa that drives mi, to lower values 
at low energy, and tan 8 > 1. The latter is also favored on phenomenological grounds. A 
non-zero By is required for both doublets to acquire VEVs, as is needed to generate all of 
the fermion masses. 

It is straightforward though somewhat tedious to expand the potential around the min- 
imum to find the Higgs mass eigenstates and eigenvalues (see Problem 3.35) and to write 
their Feynman rules, so we will only give the main results. We first write the physical neutral 
fields in terms of Hermitian components by 


ie mo V2hit p, — | { Yat har + thar 
oa va thun+ iter J’ a 2 V2hy 


5 + 
ha = ir? hh, = = ( v2hg : ) 
V2 —vqa — har + ihar 


In the last form, the tilde refers to the SU (2) conjugation and not a superpartner, and 


(10.161) 


ht = (hz). The mass eigenstates consist of two neutral (CP even) scalars h and H (with 
the convention Mp, < My), one neutral (C P-odd) pseudoscalar A, and one charged pair 
H=, as well as three Goldstone bosons z and w~. The latter are analogous to those in the 
SM (Equation 8.85) and disappear in the unitary gauge. A does not mix with h and H 
at tree level, but C’P-violating loop effects can induce mixing between all three. The mass 
eigenvalues are 


Mi=mZ+m3, M3. = M3 + M8 


1 
Miu =5 [aa + Mz F Van + M2)? — 4M2 M2 cos? 26 | , 


(10.162) 


2 
so that M? + Mj, = M4 + M3. One also finds the very important (tree-level) constraint 
M? < M2 cos? 28 < MZ, (10.163) 


with the first inequality saturated for Ma > Mz. This is to be contrasted with the SM, 
where in principle there is no upper limit on the Higgs mass. For large stop masses there 
are important radiative corrections to this tree-level result, dominated by the top/stop 
loops in Figure 10.3. At one loop, the upper limit in (10.163) is replaced (under reasonable 
assumptions) by (Haber and Hempfling, 1991; Okada et al., 1991; Ellis et al., 1991) 


39°m} M2 X? X2? 
M? < M2 cos? 26 4 t-i S J pei, t 10.164 
De ee 2p 8n2Mz, | \m? ) © M3 12M2 ) |’ ( ) 
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where MZ = 3(m? +m?) is the average of the masses of the two stop mass eigenstates 


(mixtures of tz and tr), and X; is a stop mixing parameter that will be defined in the 
discussion of sparticles below. The largest value occurs for maximal mizing, X; = V6Ms, 
while X; = 0 is the no mizing scenario. Still higher-order corrections reduce the limit 
somewhat, yielding M, < 135 GeV“* for maximal mixing and Msg in the several TeV range, 
while Mp < 125 GeV for no mixing (see, e.g., Hahn et al., 2014; for a recent review, see 
Draper and Rzehak, 2016). In both cases, the upper limit is for large tan 8 = 5 — 10. 

Such large masses are not typical, however, with most of the MSSM parameter space 
yielding Mp well below 125 GeV. One needs*° either large stop masses in the multi-TeV 
range or (unexpectedly) large mixing (e.g., Hall et al., 2012; Patrignani, 2016). The smaller 
electroweak scale v, which is closely associated with the supersymmetry breaking parame- 
ters, requires cancellations that are possible but appear rather unnatural (the little hierar- 
chy problem). Of course, this scenario (or the absence of supersymmetry altogether) is also 
suggested by the nonobservation of superpartners during early LHC running. 

The mass eigenstate fields are 


h _ (cosa —sina huR 
( H ) E a ( har ). (10.165) 


where the mixing angle a is given by 


M?(M2 — M?) 
2 neg h 
cos*(3 — a) = i (10.166) 


with —r/2 < a < 0. The massive pseudoscalar and charged Higgs fields are 
A = cos Bhur + sin Bhar, Ht =cosBht+singht, H` =(H*)', (10.167) 


while the Goldstone bosons are the orthogonal combinations. A simple way to derive (10.167) 
(and also (10.163)) is to introduce a new basis 


h _ [(cosB -sing hy 
( = ) E a a) ( hig ). (10.168) 


By construction, (h?) = 0 and (h?,) = V2v, so the massive A and H* are associated with 
hy, while the Goldstone bosons are in h;;. In unitary gauge 


1 V2H+* 1 0 
uaa hrr = -5 10.169 
A A a II Peda] ( ) 
Combining (10.165) and (10.168), the CP-even components of hz zy are 
hır \ _ (cos(8—a) sin(8 — a) h 
( hire ) 7 Ge cos(3 — a) H J’ (10.170) 


The hz zr basis is very convenient for deriving the Feynman rules for the gauge-Higgs 


44The bound is increased to Mp < 150 — 200 GeV in extensions of the MSSM that remain perturbative 
up to the Planck or GUT scale, such as some of the singlet-extended models (Section 10.2.6) or other gauge 
extensions (Kane et al., 1993; Espinosa and Quiros, 1993; Batra et al., 2004; Accomando et al., 2006). 

45We are assuming that the observed 125 GeV state corresponds to the h. However, the possibility that 
it corresponds to the heavier MSSM scalar is not entirely excluded (e.g., Bechtle et al., 2017). 
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interactions. hyrp has the same gauge interactions as the SM H in (8.39) on page 265 and 
Table 8.1. For example, the induced SM HVV (V = Z,W) vertices in (8.39) become 


Me MB 
Lavy > (2—“werw, + —224z,, [sin(s —a)h+cos(B—a)H]. (10.171) 
V V 


There are no induced AVV or H#WFZ vertices, since A and H= are associated with the 
other doublet. One can similarly read off the contributions of hz; to the four-point vertices. 
hī is a second scalar doublet unrelated to SSB. It leads to new hAZ, HAZ, hH*=WT, and 
HH*W* interactions, as well as additional contributions to the four-point vertices. These 
can be read off from the SM Rẹ gauge interactions in (8.89), with the substitutions z > A, 
w+ + H+, H > hrr, and v —> 0. (All of the rules are given in detail in Gunion et al., 


1990). We give only one example here, 


LrrpAZ = = [a 0" (cos(3 —a)h — sin(8 — a)H)] Zus (10.172) 


with gz = (g?+g’)'/?, which leads to the transitions Z + hA and Z > HA, where the Z 
can be virtual. These have no SM analogs.*° 

The Higgs Yukawa couplings also differ from the SM by the presence of two doublets. 
The quark terms derived from (10.148) are (the leptons are similar) 


—Lyup = —I4 (dpuphy — dpdyh}) +T” (ugurh?, — trgdyht) + h.c., (10.173) 


which is to be contrasted with (8.44). Therefore, the Yukawa matrices defined in (8.46) are 
modified to 


lm M* M" r? M? M’? 


hY = — = = h? = = . 10.174 

V2 Vu vsin 8’ V2 Va v cos 3 ( ) 
The Yukawa couplings of t and b to hur and har become 
Mt Mb 

hi = hy = 10.175 

t ysin B’ > peos B’ ( ) 


respectively, rather than the SM values m/v. Since m; is large, h; would diverge (generate 
a Landau pole) below the Planck or GUT scale unless sin 8 is sufficiently large, suggesting 
tan 8 = 1.7 (Dedes et al., 2001). hẹ is small in the SM, but can be considerably enhanced 
in the MSSM for large tan 8. In fact, it equals h+ for tan 8 ~ 45. If we keep only the t 
and b Yukawa couplings and use (10.165) and (10.167), the Yukawa couplings to the mass 
eigenstate Higgs fields are 


By (cosah + sina H) + e ph (— sinah + cosa H) 
vsin f v cos 3 
ji— ttt A— i tan BOYD A (10.176) 
v tan 8 V 
2 = 2 7 
= | vV2m trb, Ht + Vane 5 on B Bet, HH + h.c]. 
vtan 8 v 


46The Z* — hA process was important prior to the discovery of the 125 GeV state. The LEP 2 lower 
limit on Mp, from Higgstrahlung (e7 et —> Z* — Zh) was weaker than the corresponding SM limit of 114.4 
GeV because of the sin(8 — a) factor in (10.171). However, the complementary process (Problem 10.10) 
eet — Z* — Ah, with an amplitude proportional to cos( — a), was important for light enough h and A. 
By combining the channels, one obtained Mp, 4 Z 90 GeV assuming CP conservation (Schael et al., 2006b). 
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As expected, h and H couple as scalars and A as a pseudoscalar. (10.176) is easily extended 
to include the 7, the lighter fermions, and fermion mixing. For large tan 8 the couplings of 
A and of H and/or h to bb are enhanced, so that at high enough energies the associated 
production of A, H, or h with a b, e.g., via bG —> b > (A,H,h)b, becomes important 
at the LHC. The combination of the u term and Ls of; may lead to loop-induced flavor- 
changing neutral Higgs vertices (e.g., Hamzaoui et al., 1999; Gorbahn et al., 2011), which 
are especially dangerous for large tan 8. 

A very important special case is the decoupling limit, Ma >> Mz, in which H, A, and 
H= form a degenerate heavy doublet and h acts like the SM Higgs. From (10.166), one has 
cos(3 — a) — 0, i.e, a + 8 — 3. From (10.171), (10.172), and (10.176) h has the couplings 
of the SM Higgs, and the first tree-level inequality in (10.163) is saturated. 


Squarks and Sleptons 


The squarks and sleptons acquire masses from a number of sources. Consider the wz, pr for 
a single family. The relevant mass terms in (10.152) are 


V (tir, tir) = |r“a2|’ (lùr? + lal?) + |-ung + Pana) 


+ ba, |r|? + ba, |r|? (10.177) 
+ mi, lür? + mi,,|@r|? + (AT hürühk + h.c), 
where the three lines represent Vr, Vp, and Vso ft, respectively. The coefficients ĝa, g in Vp 


are 
Jal? — (hul? 


a [tb — sin? Oy Mina] + cos 26M2 cae — sin? Ow täna] , (10.178) 


where t3, (t3) = 5 (0), and gu, , = 3 are the third component of weak isospin and charge. 


Replacing |[“h°| > mu, the 2 x 2 mass term is therefore 


2 2 4 : 
rp. — fat xt mī + Ma + Oa, XúMau UL 
2.= Q ah) ( ae E Se (10.179) 
where 
Xu = Ay — p* cot b. (10.180) 


Similar expressions hold for d, v, and č, except the appropriate values of t? and q must 
be inserted, and cot 8 — tan for d and č. One can easily extend the discussion to allow 
mixing between the families, in which case the mass-squared matrix in (10.179) would be 
6 x 6. Most consideration has been given to models in which such interfamily mixings are 
minimized (by choosing or generating family universal boundary conditions at the GUT or 
messenger scale) in order to suppress FCNC. 

In most realistic scenarios the soft mass-squares m2 Fon dominate. The supersymmetric 


(but SU(2) x U(1) breaking) m? p terms are negligible except for the stop, and to a lesser 


extent the sbottoms and staus; the 6; fur contributions are also small but need to be included. 


The Xysmy terms lead to mixing between fr and fr. These are also small except for the 
third family, especially the stops. The latter may be split significantly for large enough X+, 
and could even lead to an unphysical unstable vacuum with a negative me for large Aj. 
The significance of X; for the lightest Higgs mass was discussed below (10.164). 

In the simple minimal supergravity and gauge mediation scenarios, the squarks are 
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usually heavier than the sleptons because of RGE running and/or because of their stronger 
coupling to messengers. One exception is the lighter stop eigenvalue, which can be the 
lightest sparticle for large enough X+. 

Squarks and sleptons can be produced efficiently in pp or pp scattering in the decay 
chains of gluinos, if they are heavier, or of other squarks/sleptons. Squarks can also also be 
produced directly by QCD processes (including t and u-channel squark or gluino exchange), 
such as GG > qq‘, qq > aq‘, qq > ğğ, or Gq > GG (the relevant cross sections for these 
and other processes are collected in the Cross Section article in Patrignani, 2016). Third 
family squarks have lower production rates because of the smaller number of corresponding 
quarks in the proton. For example, the production of ùz r by valence u quarks through 
t-channel gluino exchange is very efficient for pp, but negligible for t. The cross sections 
for the direct production of sleptons in hadronic processes are mainly due to s-channel 
Drell-Yan (W=, Z, y) exchange, and are much smaller. 


Gluinos 


The eight gluinos G* can acquire Majorana masses mg from Lsoft in (10.149). Gluinos can 
be produced by ordinary QCD processes, as well as by processes such as t-channel squark 
exchange in qq scattering. The gluino can decay into a quark and squark, G — qq‘ or qq, 
where the g may be real or virtual. This initiates a cascade of decays, ultimately leading to a 
number of q, q, G, £, and £ as well as the LSP (e.g., a neutralino or the gravitino/Goldstino) 
in R-parity conserving models, with several jets + Ær being the classic and most obvious 
signature (for an early study, see Hinchliffe et al., 1997). Since the G is Majorana, conjugate 
pairs of decays such as = qqe* Ve+ LSP and GS qqe” De+ LSP occur with equal rates (up 
to CP violating effects), so pair-produced gluinos should lead to the same number of same- 
sign charged leptons as opposite sign. From (10.134) the gluino-quark-squark interactions 
(for a single family) are 
i ~. 
Laqg =- V2gs a X Palio Gry PLGmiR 

(10.181) 


Hits ura — dh GuPran 


where Gi, = Gi, + G's, and we have inserted the Pr r to make clear the chirality. 
Equation (10.181) is easily extended to three families, provided one interprets the quark 
and squark fields as weak eigenstates. When the interaction is rewritten in terms of mass 
eigenstate fields, the vertices will in general be flavor-changing due to a (probable) mis- 
match between the quark and squark unitary transformations, much like the generation of 
the CKM matrix due to a mismatch between the u and d type quarks. This can lead to po- 
tentially dangerous FCNC effects (Gabbiani et al., 1996; Misiak et al., 1998; Altmannshofer 
et al., 2010; Arana-Catania et al., 2014) due to diagrams such as those in Figure 10.7, 
unless the squark masses are nearly degenerate, the quark and squark mixing matrices 
are approximately aligned, or the squarks are very heavy (multi-TeV). The gluino vertices 
are especially problematic because of the strong QCD coupling, but there are analogous 
problems involving neutralino and chargino vertices and the sleptons (Section 9.6). 


Neutralinos, and Charginos 


From (10.148) and (10.149) we see that the neutral Higgsino pair Oe have a Dirac mass 
—u, while the neutral gauginos B and W® have Majorana masses m g and my,, respectively. 
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Figure 10.7 Typical gluino-quark-squark diagrams leading to FCNC effects. Left: a 
new contribution to the Kz — Kg mass difference. Right: a contribution to b > 
s + (y, Z,G). There are analogous diagrams involving neutralinos, charginos, and 
sleptons. 


In addition, the gaugino-Higgsino-Higgs interactions from (10.133), 
+ ~ agi ~ t iT +B; t B- 
-L = V2g |h W hu + hi W "ha + V2g' |h} hu — his ha| + hc, (10.182) 


lead to gaugino-Higgsino mixing when the neutral Higgs fields acquire VEVs. The relevant 
terms are 


1 a oe 
-L=5 [- gva ÑR? + guaW ho + g/ v4, Bho — g'vaBh| + h.c., (10.183) 


leading to the 4 x 4 Majorana neutralino mass term 


ma o te te 
1s gror 0 m č% -F| w 
ee 5 (2 W? hy no) h.c. (10.184 
xe 2 d u -1% Wa 0 =p ho + C ( ) 
cee oo u 0 ho 


Equation rer implies four Majorana mass eigenstate neutralinos, ¥2,r = 1---4, with 
masses Mg 
es the charged Higgsinos ht and hg have a Dirac mass +p, while W! and W? 


can be combined to form a Dirac pair W+ = (W! + W?)/\/2 (cf. Equation 8.28). The 
charged Higgsinos and gauginos are mixed be (10.182), leading to the 2 x 2 Dirac mass 


matrix d 5 
-n ma BN wW 
-Eus (wt hg) a RI a eie (10.185) 
Zo £P hq 


The mass eigenstate Dirac charginos Xj. have mass eigenvalues 


1 
mee =5 (Imgl? + lul? + 2M8) (10.186) 


1 1/2 
+5 ((myl? + ll? + 2M4% )? — Amyn — Mẹ sinp?) 
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Some models predict gaugino unification, i.e., that the gaugino masses are in the ratio 


mg:Mmọy:Mp ~ ag:ag:a, ~ 7:2:1, (10.187) 

where 
s 5 gf" 10.188 
a3 =, w= V3 (10.188) 


a, (the GUT-normalized U(1) coupling) is rescaled because yY has the same Dynkin 


index for a fermion family as the SU (2) and SU (3) generators. This occurs, for example, in 
minimal supergravity schemes in which the gaugino masses are equal at the GUT scale, as 
are the GUT-normalized gauge couplings. The RGE equations preserve the ratios. Simple 
forms of gauge mediation lead to the same relation, while anomaly mediation predicts an 
entirely different pattern with the my, the smallest. Unfortunately, the actual measurement 
of the gaugino masses is complicated by mixing with the Higgsinos. We finally comment 
that gaugino mass terms break continuous R symmetries,’ but allow a discrete R-parity. 

The neutralinos and charginos are expected to be produced efficiently at hadron colliders 
through the decays of squarks and sleptons. Initial pairs of squarks and/or gluinos produced 
by QCD processes could cascade decay into multiple jets and leptons, e.g., 


Goa, BoE, Fax. (10.189) 


If the ¥9 is the LSP it will escape from the detector, implying large numbers of cascades with 
multiple jets and leptons and missing transverse energy. Detailed studies of the distributions 
of the number of jets, numbers of same and opposite sign leptons, multiple leptons, lepton 
flavors, etc., should yield information on the spectrum and test the Majorana character of the 
gluinos and neutralinos. If the intermediate particles in a cascade are on-shell, it should be 
possible to determine the masses by kinematic edge techniques (Problem 10.12). Some spin 
information may also be obtainable from the angular correlations of the decay products (e.g., 
Wang and Yavin, 2008). This would be important to distinguish supersymmetry from other 
scenarios with similar cascades, such as some versions of Little Higgs and extra dimensional 
models, as mentioned in the Higgs/hierarchy discussion in Section 10.1. 

Neutralinos and charginos (and sleptons) may also be produced directly, but at a lower 
rate, by Drell-Yan processes, t-channel squark exchange, etc. For example, trileptons plus 
missing energy could result (e.g., Barger and Kao, 1999) from 


Woe, eae’, Boers’. oan) 


In supergravity and anomaly mediated scenarios with R-parity conservation the xX? is 
usually the LSP, typically with a dominant B composition in minimal supergravity or w° 
for anomaly mediation.*® Direct, indirect, and collider dark matter searches have excluded 
much of the parameter space for neutralino dark matter,4? but there is still an allowed 
region with a low spin-independent cross section (Figure 10.2). For a recent discussion, 
see (Baer et al., 2016). In models with a lower supersymmetry breaking scale, such as 
gauge mediation, the LSP is expected to be the gravitino/Goldstino, g3/2. The next lightest 


47The continuous R symmetries can be restored in models with Dirac gauginos, which are extensions of 
the MSSM involving additional gauge adjoint chiral supermultiplets (e.g., Patrignani, 2016). 

48The dominantly we chargino is expected to be somewhat heavier due to loop corrections (Pierce et al., 
1997), allowing WE > Wr. 

49The ðr would be another cold dark matter candidate, but it is strongly excluded by direct CDM 
detection searches. 
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sparticle (NLSP) may be the 9 or it could be a charged (or even colored) particle, such as 
the Tz. The NLSP may decay promptly, with a displaced vertex, or outside the detector, e.g., 
by x? > (93/2; 7293/2; Or hg3/2) or £ —> £93/2. Possibilities for dark matter and cosmological 
constraints from NLSP decays are discussed in (Giudice and Rattazzi, 1999). 


Experimental Constraints on Supersymmetry 


As of September 2016, there has not been any direct experimental evidence for supersym- 
metry, implying that if supersymmetry exists in the low-energy theory it is most likely in 
the decoupling limit, in which the sparticle and extra Higgs particle masses are large com- 
pared to Mz. (Some possible loopholes are mentioned below.) This is also suggested by the 
relatively large value 125 GeV of the Higgs mass. 

In the decoupling limit, the effects of the MSSM on the precision electroweak observables, 
such as the oblique parameters introduced in Section 8.3.6, are small (e.g., Erler and Pierce, 
1998; Heinemeyer et al., 2006; Ramsey-Musolf and Su, 2008; Cho et al., 2011). Similarly, 
the predictions for FCNC processes such as b + sy and B,q — wtp (Section 8.6.6) are 
very similar to those of the SM (which roughly agree with experiment) for relevant regions 
of parameter space, at least in the CMSSM (Buchmueller et al., 2014). However, for heavy 
sparticles the contribution of the MSSM to the anomalous magnetic moment of the muon, 
gu — 2, from diagrams such as in Figure 2.22 on page 73 is also expected to be small, and 
the condition (2.377) for the MSSM to explain the g,, — 2 discrepancy is most likely not 
satisfied. 

The ATLAS and CMS collaborations have carried out a great many supersymmetry 
searches, including analyses motivated by the CMSSM, gauge mediation, R, violation, sim- 
plified models, and the pMSSM. 

There was no sign of the heavier Higgs states predicted by the MSSM or general two- 
doublet models in Run 1 at the LHC. Especially stringent were limits on H or A decaying to 
t+7~, which are strongly enhanced in the decoupling limit for large tan 3 (similar to the bb 
couplings in (10.176)), with tan 8 = 10(40) excluded for M4 ~ 300(800) GeV (Patrignani, 
2016). The limits are weak for smaller tan 3, but it is then more difficult to achieve Mp = 125 
GeV since the upper limit on the tree-level term in (10.163) becomes smaller. 

The experimental limits and future prospects for sparticle masses depend on the mass 
of the LSP; the ordering of the masses, e.g., whether the gluino is lighter than all of the 
squarks; and other model dependent assumptions, so they will not be described in detail 
here. Searches at LEP and the Tevatron are surveyed in (Feng et al., 2010). The ATLAS 
and CMS Run 1 results,°? analyzed in the pMSSM framework, are given in (Aad et al., 
2015c; Khachatryan et al., 2016b), respectively, while general reviews may be found in (e.g., 
Melzer-Pellmann and Pralavorio, 2014; Autermann, 2016; Patrignani, 2016). Roughly, for 
supergravity models with a sufficiently light ¥? the LHC Run 1 was sensitive to masses of 
around 1.2 TeV for gluinos and for squarks of the first two families; 600 GeV for t and b; 
400 GeV for X7, X8; and 300 GeV for charged sleptons. There were also significant limits 
placed on the interaction strength of WIMP dark matter candidates (i.e., X?) over a wide 
mass range from the nonobservation of an associated jet, Z, y, etc. Similar sensitivities were 
achieved for gauge mediation, especially for decays of the NLSP into a photon. Preliminary 
Run 2 results through mid 2016 extended the exclusions by more than 50% in mass. 

The various searches come with many caveats, coverage gaps, and loopholes. For exam- 
ple, limits are weaker for compressed spectra, in which parent and daughter particles are 
sufficiently close in mass to suppress a decay mode, or to reduce visible and missing energies 


50See (Aad et al., 2009; Bayatian et al., 2007) for the respective capabilities of ATLAS and CMS. 
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below a trigger threshold. Thus, limits in supergravity are strongly dependent on the 9 
mass, and are usually presented as exclusions in the mg — mo plane, where p is a super- 
partner. Some limits are presented assuming a 100% branching ratio for a particular decay, 
and can be weakened by having multiple decay modes. (This is one of the motivations for 
a pMSSM analysis, which takes this possibility partially into account.) Most of the limits 
are not directly applicable to models with R-parity violation (below), in which there may 
be no missing Er, and for which separate analyses are needed. Similar comments apply to 
models involving a long-lived particle that decays away from the main interaction vertex 
or outside the detector. Limits may also be weakened in variations on the MSSM. These 
include models with Dirac gluinos, for which the important production channel wu > wu 
is absent, and stealth models, in which a new light chiral supermultiplet drains off most of 
the energy in a decay chain before decaying back to SM particles and a light LSP. 


10.2.6 Further Aspects of Supersymmetry 
R-Parity Violation 


The superpotential in (10.148) assumed a conserved R-parity, a discrete R symmetry under 
which each particle of spin S has Rp = (—1)3(8-4)+?5. All of the ordinary particles (quarks, 
leptons, Higgs, and gauge bosons) have Rp = +1, while their superpartners have Rp = —1. 
R-parity therefore ensures that the LSP is stable and suppresses precision electroweak vertex 
corrections. However, it is possible add terms to W that violate R-parity, 


Wr, ~ Hul , LLE® , LQDo , U°D°D®, (10.191) 
we —“—— a —_— 
Į mixing & dilepton JL \eptoquark B diquark 


where we have suppressed the coefficients (the LLE® and U°D°D¢ terms must be antisym- 
metric in the LL or D°D° family indices). The first three terms violate lepton number, but 
are otherwise allowed because L and Hg have the same SM gauge quantum numbers. The 
fourth term violates baryon number. The LQD° and U°D°D* operators imply leptoquark 
and diquark couplings, respectively, for the squarks (Section 8.6.6), while the čr in LLE* 
is a dilepton (Cuypers and Davidson, 1998). R-parity can also be violated spontaneously if 
(L,R) # 0. 

The simultaneous presence of the leptoquark and diquark couplings would lead to rapid 
proton decay via dé exchange, which would generate a B — L conserving operator @qud. One 
of the two must therefore be absent or incredibly small, but the other would be allowed. 
The diquark operators could lead to neutron oscillations, i.e., n — ñ transitions or nn 
annihilations in a nucleus®! (e.g., Kuzmin, 1970; Calibbi et al., 2016). The various opera- 
tors are also strongly constrained by precision electroweak measurements, FCNC, EDMs, 
and direct collider searches, especially for the first two families (Barbier et al., 2005; Al- 
lanach et al., 1999; DorSner et al., 2016). The H„L operator or sneutrino VEVs would 
lead to neutrino-neutralino (and charged lepton-chargino) mixing, which could give mass 
to one neutrino. The other neutrinos could obtain mass at loop level from the trilinear 
operators (e.g., Grossman and Rakshit, 2004; Rakshit, 2004). 

In the presence of R-parity violation the LSP would no longer be stable, so other candi- 
dates (such as an axion) would be needed for the dark matter. For example, the couplings 


in (10.191) could lead to such decays as ¥9 > 4+4 D l* aq, D ad, or qqq. Such decays can 
be searched for if they decay inside the detector, or would resemble the R-parity conserving 


51The possibility of neutron oscillations is more general than R-parity violation. For a general review, 
see (Phillips et al., 2016). 
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case if they occurred outside. There would also be the possibility of other types of LSP, such 
as a 7 or even a gluino. Depending on the LSP and the flavor structure of the couplings 


many types of decays with unusual signatures are possible, including the ¥? decays and 


others such as ¢ > retu? V, 0, > uet, f > 7+b, and G —> tbs. 


The u Problem 


The u problem was already introduced in Section 10.2.1. A nonzero p is not actually required 
for the Higgs sector of the theory, since the needed terms in the tree-level potential can be 
generated by the soft terms Ming and By in (10.153). However, a significant u is required 
to generate large enough masses for the charginos, as can be seen from (10.186). 

One possibility is to generate u at the same time as the soft terms by non-renormalizable 
operators coupling the observable and hidden sectors (Giudice and Masiero, 1988; Casas 
and Munoz, 1993). To illustrate how this works, it is convenient to introduce a spurion 
superfield X = 6?Fx, with a non-zero F component. A spurion is usually an effective or 
composite field (or superfield in this case) that is introduced with the right transformation 
properties to parametrize the effects of symmetry breaking. 

One typically expects string or supergravity effects to generate allowed higher- 
dimensional operators involving X and the other superfields in the Kahler potential or 
superpotential, presumably associated with inverse powers of the Planck scale Mp. For ex- 
ample, a term -c X XPS, / M?, in the Kahler potential would lead to the soft mass-squared 


term j i N i 
XIX Oh Fy 
=— | d*6c, a= ter 10.192 


with m2 = c,|Fx|?/MŻ2. Gaugino masses and A terms can be vee generated, e.g., a su- 
perpotential term —c123X ©, ®2®3/Mp leads to a corresponding A term C123F'x¢,¢23/Mp. 
In the Giudice-Masiero mechanism one also introduces a term cyX'H,Ha/Mp + h.c. into 
the Kahler potential, which yields a u term 
i 
Fy 


xt n 
W, = f Ee d Z ep -X H, Ha, (10.193) 
Mp 


with u = CuFt /Mp the same order of magnitude as the soft terms. 

In this discussion, we have implicity assumed a supergravity mediation, so that Msoft ~ 
Fx /Mp. In schemes with a lower Fx, such as gauge mediation, such operators are still 
present but are negligibly small compared to Msoft, which is now S Fx /M, where M « Mp 
is the messenger scale. It is also sometimes the case, even in supergravity mediation, that 
there are new discrete or continuous symmetries that allow the operators such as (10.192) 
but not the one in (10.193). 

An alternative solution is to utilize renormalizable operators. In the singlet-extended 
versions of the MSSM*? one replaces u by a dynamical variable, by introducing a SM 
singlet superfield S with a superpotential coupling 


WsH, Ha = AsSAuHa (10.194) 


to the doublets. If S acquires a VEV an effective u parameter Heff = As (S) is generated. 
The singlet extensions usually impose additional symmetries on the theory to forbid an 
elementary term. There are a number of realizations of this mechanism (see Accomando 


52For other possibilities, see (Komargodski and Seiberg, 2009). 
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et al., 2006; Barger et al., 2007; Maniatis, 2010; Ellwanger et al., 2010, for reviews). The 
best known is the next to minimal model (NMSSM), in which a discrete Z3 symmetry 
forbids u but allows the cubic terms Ag SH,Hqa and «KS? in W (Ellis et al., 1989). The 
original form of the NMSSM suffers from cosmological domain wall problems because of the 
discrete symmetry. This can be remedied in more sophisticated forms involving a broken R 
symmetry (Panagiotakopoulos and Tamvakis, 1999b). A variation on that approach yields 
the new minimal model (nMSSM), in which the cubic S? term and its soft analog are 
replaced by tadpole terms linear in S with sufficiently small coefficients (Panagiotakopoulos 
and Tamvakis, 1999a). A U(1)’ symmetry, which is perhaps more likely to emerge from 
a string construction, is another possibility (Suematsu and Yamagishi, 1995; Cvetié and 
Langacker, 1996; Cvetié et al., 1997). This avoids the domain wall problem by embedding 
the discrete symmetry of the NMSSM into a continuous one. The singlet extended models 
involve an additional Higgs scalar and (except for the U(1)’ case) an additional pseudoscalar 
that may be light. The branching ratios (into SM particles) and couplings of the lightest 
Higgs may be reduced by decays into two pseudocalars (Chang et al., 2008) or by an 
admixture of singlet component (e.g., Barger et al., 2006). The theoretical upper limit in 
(10.164) is also increased (to around 150 GeV) by new F term and (for U(1)’) D-term 
contributions to the potential. As mentioned in Section 10.1, the singlet-extended models 
also make it relatively easy to achieve the strong first order electroweak phase transition 
needed for electroweak baryogenesis. 


Gauge Unification 


As will be discussed in Section 10.4 simple grand unified theories predict that the properly 
normalized gauge couplings in (10.188) should all be equal at and above the GUT (unifica- 
tion) scale Mx, above which the symmetry breaking can be ignored. Such unification may 
also occur in some superstring theories that compactify directly to the SM or the MSSM. 
Unification can be tested using the observed gauge couplings at Mz and the known RGE 
equations for the gauge couplings. From (5.36) on page 171, at one loop, one has 


1 1 Q? 
= 4nb; In“, 10.195 
a;(Q?)  ai(M3) i Mz l ) 
where 
Al 33 
2 1 10 1 5 
b — —19 H 10.1 
2 1672 6 OF Ier? aoa) 
b3 — —3 
SM MSSM 


in the SM or the MSSM, respectively. There were early indications that unification was more 
successful in the MSSM than in the SM (Dimopoulos et al., 1981; Amaldi et al., 1987), but 
precise tests became possible (Ellis et al., 1990; Amaldi et al., 1991; Langacker and Luo, 
1991; Giunti et al., 1991) after the gauge couplings at Mz were determined accurately at 
LEP. (In practice, one needs to include two-loop effects; corrections or uncertainties associ- 
ated with the t, Higgs, sparticle, and GUT particle thresholds; and a conversion of the MS 
gauge couplings to the dimensional reduction (DR) scheme? appropriate for supersymme- 
try (Langacker and Polonsky, 1993, 1995; Carena et al., 1993; Bagger et al., 1995).) As can 
be seen in Figure 10.8, the couplings do not precisely unify when extrapolated assuming 
the SM, but do approximately meet at around Mx ~ 5 x 1016 GeV when the effects of the 
new particles in the MSSM are included in the 6 functions. 


53The DR scheme is similar to dimensional regularization except that the Dirac algebra is evaluated in 
four dimensions. 
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Figure 10.8 Extrapolation of the observed gauge couplings in the standard model and 
MSSM. The lower plot assumes that the new MSSM particles have mass around 
5 TeV. The extension above Mx assumes supersymmetric SU (5). 


10.3 EXTENDED GAUGE GROUPS 


There have been many proposed extensions of the gauge symmetry of the standard model or 
of the MSSM. One class, which is the focus of this section, involves extending the electroweak 
SU(2) x U(1) gauge symmetry. Such extensions are sometimes motivated by bottom-up 
considerations, e.g., solving the u problem of the MSSM in U(1)’ extensions, or restoring 
P and C invariance in SU(2)z x SU(2)r x U(1) models. Another, and perhaps stronger, 
motivation is top-down: such extra gauge symmetries often appear as accidental remnants 
of the breaking pattern of an underlying theory, such as a superstring theory or a theory in 
which new TeV scale physics is responsible for SU (2) x U (1) breaking. 

Another class of extensionsäć involves the introduction of horizontal (or family or fla- 
vor) symmetries, which relate or distinguish between different fermion families and hopefully 
shed light on the spectrum. The horizontal symmetry could be discrete or continuous. If it is 
continuous and spontaneously broken it is presumably a gauge symmetry to avoid unwanted 
Goldstone bosons. FCNC, which can be mediated, e.g., by gauge bosons, are a serious con- 


54For a general discussion, see (Ramond, 1979). Examples include (Altarelli et al., 2008; Luhn and Ra- 
mond, 2008). See Section 9.5.5 for applications to neutrinos. 
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straint on models with family symmetries broken at the electroweak or TeV scale. These 
are not necessarily present in the original weak eigenstate basis. Family transitions at one 
vertex may be compensated at the other, or there may be family diagonal but nonuniver- 
sal transitions. However, the interfamily mixing associated with the transformation to the 
mass eigenstate basis almost inevitably leads to FCNC. Unfortunately, there are no really 
compelling models of horizontal symmetries. 

There have also been various extensions of QCD, such as a spontaneously-broken chiral 
or non-chiral SU(3) x SU(3) group, with the unbroken SU(3) emerging as the diagonal sub- 
group. The massive color octet bosons are referred to as axigluons or colorons, respectively. 
See (e.g., Chivukula et al., 2015) and Problem 5.3. Various strongly-coupled gauge groups 
have also been suggested in connection with electroweak symmetry breaking (Section 8.5.3). 

Grand unified theories, which unify the strong and electroweak interactions, will be 
discussed in Section 10.4. 


10.3.1 SU(2) x U(1) x U(1)! Models 


Many extensions of the SM and MSSM involve additional U(1)’ factors and associated Z’ 
gauge bosons (for reviews, see Hewett and Rizzo, 1989; Leike, 1999; Langacker, 2009; Han 
et al., 2013; Patrignani, 2016). These include superstring theories, grand unified theories, 
and many models involving new TeV-scale physics, such as dynamical symmetry breaking 
and little Higgs models. As a simple example of why U(1)’s so often survive, consider SU(m) 
broken by a real adjoint Higgs, ¢;. Define the m x m matrix ¢ = ¢;L',, analogous to (3.31) 
on page 97. The VEV (¢) can be diagonalized by an SU(m) transformation, which makes 
it clear that the unbroken subgroup is U(1)™ (or a larger group if some of the diagonal 
entries are equal). 

The extra gauge bosons could be extremely heavy, massless or very light, or anywhere 
in between. Here we will mainly consider electroweak/TeV scale Z’s. These are especially 
motivated in supersymmetric models, where the SU(2) x U(1) and U(1)’ breaking scales are 
usually both driven by msof¢. We will also comment briefly on very light “dark photons”. 

In addition, some models involving extra dimensions of space allow the Z and other SM 
gauge bosons to propagate in the bulk, implying Kaluza-Klein excitations with masses of 
order R7! ~ 2 TeV x (107!"cm/R), where R is the scale of the extra dimension. These 
excitations would be similar to the bosons from new gauge symmetries, except that their 
couplings would be essentially the same as the SM ones. 


U(1)! Gauge Interactions 


Consider the case of a single additional U(1)’ factor, with family universal couplings and 
(initially) no kinetic mixing (Section 2.14). The SM coupling of the three neutral gauge 
bosons to fermions in (8.55) on page 271 generalizes to”? 


-Lyo = 9J3W + of Fy Bu + 9259 Zg, = eJb Ap + n IU Ziu + G2F5 Zou, (10.197) 
where Z9, is the new gauge boson, J} is the U(1)’ current, and gz the gauge coupling.”® 


If we were to work in terms of W?, B, and Z9, then following SSB one would have to deal 
with a 3 x 3 gauge boson mass matrix. However, this can be avoided (so long as electric 


55In some cases, the two U(1) currents are linear combinations of Jy and a second current. However, if 
the model contains the SM as a subgroup, it can be written in this form by a rotation of the two abelian 
gauge bosons. 

56The gı and gg used here differ from the ,/5/3g’ and g used in connection with gauge unification. 
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charge is not broken) by first transforming to A and Z? = Z, which are related to W? and 
B by (8.30) and (8.33), i.e., they are the neutral SM gauge bosons. Similarly, Ji’ = J4 /2 
where J? is the SM outreni defined in (8.73), and gı = (g? + g?)!/2 = gz. This second 
form is especially convenient when the mixing with the Z9 is a perturbation. The currents 
J# a = 1,2, are 


JE = Dvr Hl) Pt + lr) Pal = 5 Lieto G(r) ]tr, (10.198) 


r 


where the e} p are the SM couplings in (8.76), and the ef g depend on the U(1)' model. 
The currents take the same form in the fermion weak and mass bases because of our as- 
sumption of family universality. When working in terms of left chiral fermion fields, it will 
be convenient to define the charges 


Qaf = SAGDE Qafe = ent): (10.199) 


for fermion fz and its charge conjugate ff. We similarly define U(1) charges Qa; for a 
complex scalar ¢;, with Qi; = t? —sin? Ow qi. The neutral current part of its gauge covariant 
derivative is therefore 


a=1 


2 
Dydi = Q +ieqgA,+i>— 1:22) Qi. (10.200) 


All members of an SU (2) multiplet must have the same Q% since the two groups are assumed 
to commute. 


Gauge Boson Masses and Mass Mixing 


When some of the scalars acquire VEVs, they will generate masses for the neutral (and 
charged) gauge bosons. Assuming that electric charge is not broken, i.e., qi = 0 for all 
scalars with (¢;) # 0, one finds from (10.200) that the photon A,, remains massless, while 
ZY» develop a mass-squared matrix 


2 297 JP KOD 29192 D2, tFQil (Hi)? ) ( MZ. A? ) 
= = 0.201 
Mz_z E 293 X Qj ll A Mọ J” (aaan 


where Q; = Q2;. Mzo would be the Z mass in the absence of mixing. If the Higgs fields are 
all SU (2) doublets and singlets, then 


M2 
2 _ My =% 2 _ i v?, 
Myo = oe Oy > vi = (10.202) 


where v; = V2(ġi) and v ~ 246 GeV, as in the SM. The mass eigenstates corresponding to 


(10.201) are 
Z \ _ Z? cos sin@ 
( Z2 ) -u ( Z9 ). ve ( wii. cosh la (10.203) 


with eigenvalues 


Í 
Mi2=5 [a + Mz F y M2 - Mz)? +404 ; (10.204) 
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The mixing angle is given by 
1 2A? M2, — M2 
0 = T3 arctan Gz az) ; tan? = ia) A L (10.205) 


An important limit is Mz >> (Mzo, |A|), which typically occurs because an SU(2) singlet 
field S has a large VEV > v and contributes only to Mz. One then has 
4 


A 
M? ~ Mo — a < M3, = M3 ~ Mi, (10.206) 
Zz’ 
ad A? 2 S BQO 
go MÉ . t3Q5| (bi 
O~ ~C with C = —-= 4 10.207 
ME ~ O n M2 DOUE eve) 


C is model dependent, but typically |C| < O(1). From (10.205)—(10.207) one sees that both 
|0| and the downward shift (Mzo — M,)/Mzo are of order M?/M3. 

As an example, consider a complex SU (2) singlet field S and two SU(2) doublets u,a 
or their conjugates hu, a. The doublets are defined as in (10.12) and (10.13) for the MSSM, 
but we are not necessarily requiring supersymmetry. Then, 


1 1 
MZo = 91 (vul? + [va ?), A® = 5 9192(Qulvul” — Qalval”) 
Mz = 93(QilYul” + Qalval? + Qs\s|”), 


with Qua = Qh,,hg and s = V2(S). In the supersymmetric version of this example (e.g., 
Cvetié et al., 1997) one usually assumes that the U(1)’ symmetry does not allow an ele- 
mentary u term A,Hg in W, i.e., Qu + Qa 4 0, but does allow the coupling Ag SH,Hq 
in (10.194), i.e., Qs + Qu + Qa = 0. The potential for S (which denotes the scalar compo- 
nent as well as the superfield), h°, and h® is then V = Vp + Vp + Vsoft, where the MSSM 


expressions in (10.153) are replaced by 
Vr = àz (lal? lhal? + ISP lhal? + 191a) 


(10.208) 


2 2 
g 2,9g 2 
Vp = & (lhal? — Iha)? + F (Qulhul? + Qalhal? + Qs15/?) (10.209) 
Veo ft — mh IRS]? + mi, IRS 2 + m2z|S|? = (As As Shih + h.c.) : 


u and Bu from the MSSM are replaced by per = Ag(S), and (Bu)eff = Ag Ag(S). There 
is no MSSM analog of the first (second) term in Vr (Vp). One can make field redefinitions 
and gauge transformations so that (Byu)erf and the VEVs are real and positive, and it 
is automatic (at least in the tree-level Higgs sector) that the minimum preserves U(1)g 
invariance. (The contributions of h} and hj to V are similar to the MSSM and are not 
displayed.) We see from (10.209) that all of the dimensional parameters in V are given by 
soft supersymmetry breaking terms, resolving the u problem and suggesting that both the 
electroweak and U(1)’ breaking scales should be close to mso ft, up to an order of magnitude 
or so.°’ The new terms in (10.209) increase the tree-level limit in (10.163) for the lightest 
Higgs scalar to 


1 
Mẹ? < M2 cos? 26 + zas” sin? 26 + g2[Qa cos? 8 + Qu sin? 6]?v”, (10.210) 


allowing Mp as large as ~ 150 GeV when higher-order corrections are included (Barger 
et al., 2006). 


57More complicated models involving additional S fields with opposite sign charges may allow intermediate 
scale U(1)’ breaking along flat or nearly flat directions (Cleaver et al., 1998; Erler et al., 2002). 
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Kinetic Mixing 
Kinetic mixing was discussed generally in Section 2.14. For a U(1); x U(1)2 gauge theory, 
the most general gauge kinetic term°® is (Holdom, 1986) 


Ces -2A Pi -2 pomeo 3 P Paw (10.211) 


where Ê 0 uv are the field strength tensors. Without loss of generality, we can rescale the ais 
so that cı = cg = 1 and (since the eigenvalues must be positive) c12 = sin x. The kinetic 
mixing term is not expected initially if the U(1)? is embedded in a simple group, but it 
can be generated at the loop level due to multiplet mass splitting, e.g., it can arise from 
RGE effects if Tr m<u(Q1Q2) # 0, where the notation indicates that the trace is restricted 
to the particles lighter than the scale u (e.g., Babu et al., 1998). It can also come about by 
superstring loop effects (Dienes et al., 1997) or by other constructions. 

Kinetic mixing could provide a means to induce a weak coupling between the ordinary 
sector and a supersymmetric or dark matter hidden sector. For example, a massless Z’ 
mixing with the photon would result in a small fractional electric charge for the hidden 
sector particles (Holdom, 1986). Here we restrict ourselves to the case in which kinetic 
mixing is a small (e.g., RGE induced) perturbation on the effects of a TeV scale U(1)’. 
Suppose one has 
~o 1a ~o sin X 


Ouv 
72 Fouw T 2 


z à 1 « a 4 
E” RD + z7" Mz- gia, (10.212) 


where M2_,, is induced by Higgs VEVs, and has the same form as (10.201). One can put 
Lkin into canonical form using the non-unitary transformation 


ZA, _f 1 —tanx Z9, E z9, 
( 3, ) = ( 0 ifeosx )\ 28, J = ze > (10.213) 


in which case the mass-squared matrix becomes 
M2 og > Mig = V" MG nV, (10.214) 


which must be diagonalized as in (10.203). It is straightforward to show that in the limit 
(Mzo,|A|) < Mz and |x| < 1 the effects of x on M? and 0 are second order and negligible 
(ie., A? —> A? — MZ,x in (10.206) and (10.207)). The only important change is then 
that the neutral current couplings of the ZÌ in (10.197) are shifted to include a first-order 
component proportional to Jj, 


-Lyco > JG Ap t+ 9J} Ziu + (Gedy — gixJt') Zop (10.215) 


The light boson couplings are not affected at this order, and one must still include the 
further effects of mass mixing (the @ rotation). This shift in the Z? couplings is especially 
important because in many models the fermion charges are orthogonal (i.e., Tr (QiQ2) = 0) 
prior to kinetic mixing; it can be used, e.g., to suppress the couplings of the Zə to leptons 
(leptophobic models). 


58 Analogous kinetic mixing terms are not allowed for non-abelian theories. 
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U(1)! Models 


There are an enormous number of possible U(1)’ models, distinguished by the Z’ mass, the 
chiral couplings to the quarks and leptons and whether they are family universal, the ex- 
tended Higgs sector, the possible exotic fields that may be necessary to avoid anomalies from 
the extended gauge sector, possible couplings to a hidden sector, possible kinetic mixing, 
etc. Even assuming electroweak/TeV scale masses, couplings comparable to electroweak, 
and family universal charges there are many models in the literature. Here we list a few of 
the major classes. 


The sequential Zsm boson is defined to have the same couplings to fermions as the 
SM Z boson. This is difficult to achieve in an ordinary gauge theory,” but is a useful 
benchmark. 


The electric charge Q and the weak hypercharge Y = Q — T? in (8.5) can be written 
(at least for the SM particles) as 


Q =T} +Y =T? +T 475, (10.216) 


where TÈ takes the values t = +4 for ve and up; t} = —4 for er and dp; and 
0 for wr. Ter is defined as (B — L)/2, where B (L) are baryon (lepton) number. 
The left-right (LR) models, which can descend from the SU(2), x SU(2)r x U(1) 
models considered in the next section or from SO(10) grand unified theories, involve 
the neutral current interaction 


—LyNc = gFR Wi + gRrI3pWry, + gBLJBWBLu, (10.217) 


in an obvious notation. Anticipating that U(1)3r x U(1) gr will be broken to U(1)y at 
a scale Mz >> Mzo, it is convenient to put (10.217) into the form (10.197) by rotating 
wÈ and Wpgz to a new basis B and z9. This leaves invariant the kinetic energy terms 
(which we assume are canonical). One can take B = cos y Wsp +sin y Wpgz and choose 
y so that B couples to g'Y. One then finds (Problem 10.16) that (8.69) becomes 


=+ =at . (10.218) 


Z9 = sin y W3r — cos y Wpz is associated with the charge 


1 
QE = laren = Ta ; (10.219) 


where a = tany = grR/gBgL = V k2 cot? Ow — 1, with k = gr/g. The coupling has 
been normalized to g2 = E g tanw ~ 0.46. For example, the left-right symmetric 
version of the SU(2)z x SU(2)r x U(1) model (Mohapatra, 2003), breaking at the 
TeV scale, implies gr = g so that a ~ 1.53. Realistic breaking patterns for SO(10) 
typically lead to a ~ 0.7 — 0.9 (Robinett and Rosner, 1982b). The x model in Table 
10.2 coincides with the LR model with a = /2/3 ~ 0.82. Generalizations of the LR 
models, e.g., to J2 = Jg, are described in the reviews. 


The LR models and their generalizations are attractive in that they are the unique 


591t could occur as a diagonal subgroup in a complicated constructions involving new fermions, or as a 
Kaluza-Klein excitation in theories with extra TeV~1 scale dimensions. 
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nontrivial family-universal U(1)’ extensions of the SM that do not require the intro- 
duction of additional chiral fermions (other than vg) to cancel anomalies. However, 
they are perhaps less interesting in a supersymmetric context, because the Higgs su- 
perfields H,, q form a vector pair with T= +3 and Tp, = 0. Therefore, an elementary 
u term in (10.148) is not forbidden by the extra U(1)’. One must also introduce SM 
singlet supermultiplets to break the U(1)’. They would most likely be introduced as 
non-chiral vector pairs to avoid anomalies, leading to their own “ju” problem. (One 
could instead give large VEVs to the scalar partners of the v°, but this would break 
R-parity and would be challenging for neutrino phenomenology.) 


Many Z’ studies focus on the two extra U(1)/s that occur in the decomposition 
of the Eg GUT (Robinett and Rosner, 1982a; Langacker et al., 1984; Hewett and 
Rizzo, 1989), i.e., Eg > SO(10) x U(1)y and SO(10) + SU(5) x U(1)y. We con- 
sider them only as simple examples of anomaly-free U(1)’ charges and exotic fields, 
and do not assume a full underlying grand unified theory.°° In Eg, each family of 
left-handed fermions is promoted to a fundamental 27-plet, which decomposes under 
Es > SO(10) + SU(5) as 


27 + 16 +10 +1 — (10+ 5* +1) + (64+ 5*) +1, (10.220) 


as shown in Table 10.2. In addition to the 15 fermions charged under the SM gauge 

group and the v°, each 27-plet contains a second SM singlet, the S. Both the v° and 

the S may be charged under the U(1)’. There is also an exotic color-triplet quark D 

with charge —1/3 and its conjugate D°, both of which are SU(2) singlets, and a pair 
+ 0 

of color-singlet SU(2)-doublets, hy = ( y ) and ha = ( ha ) with Yh, a = 1/2. 
u d 

The exotic fields, which are needed to cancel anomalies, are all singlets or non-chiral 

under the standard model, and therefore do not yield large corrections to the precision 

electroweak observables. However, they are usually chiral under the U(1)’, i.e., they 

are quasi-chiral. 


TABLE 10.2 Couplings of the Z9, Z$, Z9, and Z9, to an Ee 27-plet of 
left-chiral fermions or superfields. 


SO(10) SU(5) 2/10Q, V24Qy 2V15Q, 2V10Qn 


10 (u,d,u°, er) —1 1 —2 1 
16 5* (d°, v,e”) 3 1 1 2 
1 ve —5 1 —5 0 
5 (D,hu) 2 -2 4 = 
10 5* (D°,ha) -2 -2 1 -3 
1 1 S 0 4 —5 5 


The Eg models can be considered in both non-supersymmetric and supersymmetric 
versions. In the supersymmetric case, the scalar partners of the S (or the v°) can 
develop VEVs to break the U(1)’ symmetry. Similarly, the scalar partners of one hia 


60Tn a full supersymmetric grand unified theory with a TeV-scale U(1)’ the scalar partners of the D and 
D° would have Yukawa couplings related to those of the Higgs, and could therefore mediate rapid proton 
decay. 
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pair can be interpreted as the two Higgs doublets of the MSSM. The two additional 
hu, a families may be interpreted either as additional Higgs pairs or as exotic-leptons 
(ha has the same SM quantum numbers as an ordinary lepton doublet, while hu would 
be conjugate to a right-handed exotic doublet). 


U(1)y x U(1), may survive to low energies, though most studies assume that only 
the U(1)y, the U(1),, or a U(1) associated with a linear combination of their charges 


does so. Examples of linear combinations include Q} = E Qy- TE Qy, which occurs 


in some heterotic string compactifications, or Qy = +Qy + VE Q,,, which allows 
a neutrino seesaw with a large v° mass (Ma, 1996) or can avoid cosmological and 
astrophysical constraints on Dirac neutrinos (Barger et al., 2003b). 


Except for the x model, all of these U(1)’s forbid an elementary u term. They allow 
all of the other (Yukawa) interactions in the MSSM superpotential in (10.148), as well 
as the terms SH, Ha and SDD*, where H, a are the Higgs/exotic lepton superfields, 
and D and D° represent the superfields as well as the fermions. Therefore, the U(1)’ 
breaking induced by (S) can also generate masses for the Higgs and exotic fields. 
Leptoquark or diquark operators, such as LQD* or US D°D®, which can allow D decay 
and also FCNC, are also allowed by the U(1)’. (They cannot both be present because 
they would then lead to rapid proton decay.) One problem with the Eg models is that 
in its minimal form the particle content is not consistent with the minimal MSSM-type 
gauge coupling unification. This can be restored by adding a vector pair of Higgs-type 
doublets (e.g., an additional H,, + H£) from an incomplete 27 + 27*, but only at the 
cost of introducing a p-type problem for this new pair. 


The supersymmetric LR and Eg models all either involve a vector pair of chiral super- 
fields, therefore leading to a version of the u problem, or else are not consistent with 
the simple form of gauge unification found in the MSSM. The minimal unification 
models (Erler, 2000) remedy this by starting with the MSSM particle content and 
then choosing exotics in sets that preserve the MSSM unification at tree-level, with 
U(1)’ charges chosen to cancel anomalies. At least two SM singlets S; with different 
U(1)’ charges are required to given mass to all of the exotics, however. It is possible 
but nontrivial to ensure that enough of them acquire VEVs, and to avoid unwanted 
accidental global symmetries and the associated Goldstone bosons (Langacker et al., 
2009). 


Implications of a U (1y 


A Z' at the electroweak-TeV scale would have important implications for WNC, Z-pole, and 
LEP 2 experiments, as well as for direct searches at colliders, e.g., in pp or pp + Z’ > 4+4- 
by the Drell-Yan (s-channel) process. 

The low energy and LEP 2 experiments are influenced not only by Z — Z’ mixing, which 
modifies the Z couplings, but by Zə exchange and by the shift in the Z, mass from its 
SM value. The effective four-fermi interaction relevant to low energy processes in (8.79) is 
replaced (recalling that Jı = Jz/2) by 


4G 
HYG = -LYG = (vers Tt + 2wa + v3), (10.221) 


486 The Standard Model and Beyond 


where 
2 . 2 92 ‘ g2 
Peff = pı Cos’ 0 + p2 sinf 0 ~ p1, w= ae — p2) ~ ae 
1 1 
g 5 g ? 
y= (2) (pı sin? 0 + p2 cos? 0) ~ (2) p2- (10.222) 
gı gı 


In (10.222) pa = M$, /(M2 cos? 0w), and the second expressions are for small p2 and 0. In 
the same limit, the Zı mass shift in (10.206) results in 


2 
pı ~1+0°/p =1+ (2) C7 ps, (10.223) 
1 


where C is defined in (10.207). The modification of (10.222) relevant to LEP 2 is straight- 
forward. The Z-pole experiments are insensitive to Z2 exchange, but are very sensitive to 
Z — Z' mixing. This shifts the Zı mass downward according to (10.223), and therefore 
shifts the value of sin? yw obtained from Mz relative to the values from the asymmetries 
and other observables. The mixing also modifies the tree-level vector and axial couplings of 
the Zı to fermion w,. The couplings in (8.76) become 


gi — cos 0gb (r) + ra Og (r) ~ gy(r) + mcsaa) 


92 


[ (10.224) 
g'a —> cos Og4(r) + 7 sin Og4(r) ~ g4 (r) + aN 


These depend on the U(1)’ charges, so the oblique parameter formalism described in Section 
8.3.6 is usually not a good parametrization of Z’ effects. To the extent that the U(1)’ is 
a small tree-level perturbation it is usually sufficient to use the SM radiative corrections, 
though some care is needed in the definition of sin? Oy, e.g., by using the MS rather than 
the on-shell definition. 

The limits set by the WNC and LEP 2 on a Z’ mass M> were very model dependent, 
with typical sensitivities for the Eg models in the several hundred GeV to = 1 TeV range.®! 
The Z-pole experiments limited the mixing to |0| < few x107. 

The Mz limits have been superseded by direct seaches at the Tevatron and LHC, search- 
ing for resonances in pp or pp + ete~ or wtp (and also t*+7T~, dijets, tt, dibosons, etc) 
produced by the Drell-Yan process. Early Run 2 results by ATLAS (Aaboud et al., 2016) 
and CMS (Khachatryan et al., 2017) set lower limits of ~ 3 — 4 TeV on typical E¢-type 
bosons with electroweak coupling. These should be extended? to around 5-6 TeV later 
in Run 2. If a Z’ were observed, various diagnostic studies of its couplings, by forward- 
backward asymmetries, rapidity distributions, heavy quark decays, associated productions, 
and rare decays, would in principle be possible (e.g., Cveti¢ and Godfrey, 1995; Han et al., 
2013), but the statistical significance at Run 2 would be limited for masses much above the 
current exclusions. A later high luminosity phase at the LHC will extend the sensitivities 
somewhat, while a possible future 100 TeV pp collider would extend the typical reach to 
around 30 TeV (Arkani-Hamed et al., 2016). A possible future ete~ collider could also 
extend the LHC reach. Interference between s-channel Z and y amplitudes with that of a 
heavy virtual Z’ should be observable for masses well above the CM energy in cross sections, 


61These and the other results assume an electroweak scale g2, with the GUT-motivated value go ~ 
5/3g' = \/5/3g tan Ow often assumed. 
62These limits can be lower by as much as 1 TeV if the sparticle/exotic decay channels are open. 
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forward-backward and polarization asymmetries, etc. (e.g., Han et al., 2013; Arbey et al., 
2015). 

The observation of a TeV scale Z’, especially in the context of supersymmetry, could have 
a significance far beyond the Z’ itself. The possible physics implications include: (a) an ex- 
tended Higgs/neutralino sector, complicating and modifying the collider physics signatures 
and expanding the range of possibilities for neutralino (or other kinds of) cold dark matter; 
(b) the possible existence of heavy (Z’ scale) exotic particles, which may decay rapidly 
or be quasi-stable; (c) possible FCNC effects if the Z’ couplings are family non-universal 
(which often occurs in string constructions); (d) an enhanced possibility for electroweak 
baryogenesis; (e) a number of possibilities for exponential or power law suppressed Dirac or 
Majorana neutrino masses; (f) a possible efficient source for sparticle production; and (g) 
a possible connection to an otherwise hidden sector, either by kinetic mixing or by direct 
U(1)’ couplings (which often occurs in string constructions). The latter possibility could 
lead to Z’ mediation of supersymmetry breaking by a Z’ — Z’ mass difference. All of these 
possibilities are reviewed in (Langacker, 2009). 


Very Light Z's 


We finally mention the possibility of very light Z’ gauge bosons (often referred to as dark 
photons or as U bosons) with extremely small couplings to ordinary matter. A common 
scenario is that the new boson connects to ordinary matter only via kinetic mixing with 
the photon. Major motivations are that a dark photon could account for the discrepancy in 
the muon magnetic moment (Section 2.12.3) and/or that it could be a portal that mediates 
weak interactions between a dark sector (for the cold dark matter) and the SM particles. 
Typical masses considered are in the MeV-GeV range, with a kinetic mixing parameter y 
of O(107? — 1077), but much wider mass ranges and much smaller mixing are possible. 
Numerous constraints, e.g., from beam dump experiments, rare K and hyperon decays, 
fractional charges, QED, astrophysics, and cosmology, are reviewed in (Pospelov, 2009; 
Jaeckel and Ringwald, 2010; Essig et al., 2013; Alexander et al., 2016). 


10.3.2 SU(2)z x SU(2)r x U(1) Models 


The simplest SM extension involving an additional charged gauge boson is the SU(2)z x 
SU(2)r x U(1) model (Pati and Salam, 1974; Mohapatra and Pati, 1975; Senjanovic and 
Mohapatra, 1975), in which the first (second) SU(2) couples to L (R)-chiral fermions. A 
left-right (LR) interchange symmetry between the factors is usually imposed, so that space 
reflection and charge conjugation invariance hold at the Lagrangian level even though the 
theory is chiral. The original version of the model, in which the extended gauge and LR 
symmetries are broken at the TeV scale, is disfavored by gauge unification and neutrino 
mass constraints. Most modern treatments assume that one or both symmetries are bro- 
ken at a large scale, and they are usually considered in the context of a subgroup of the 
supersymmetric SO(10) grand unified theory. The model has also been revived (Agashe 
et al., 2003) in the context of warped extra-dimensional models to provide a custodial 
symmetry to protect the pọ parameter. Here, we will give a short overview of the original 
non-supersymmetric TeV scale model as useful background material. Modern developments 
are reviewed, e.g., in (Mohapatra, 2003; Maiezza et al., 2010; Cao et al., 2012; Patrignani, 
2016). 
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The SU(2), x SU(2)r x U(1) pr Fields and Gauge Interactions 


We assume that the fermion fields 


0 0 0 0 
0 Um 0 Um 0 Vm 0 Uy, 
mL = ( 0 ) ’ mR 7 ( 0 ) ’ lnii = ( 0 ) ’ Cnr = ( 0 ) (10.225) 
dm J; din R EmA T Em Su 


transform under SU(2)z x SU(2)r x U(1)gL as 


Ganis Gick G35, Ciao; Aa, (10.226) 
6 6 2 2 

generalizing (8.4). The SU (2); and SU(2)p gauge bosons W;"",,7 = 1, 2, 3, therefore couple 

to V — A and V + A currents, respectively. The U(1)sz gauge boson W$, couples to the 

charge Tp, = (B — L)/2 that was defined in (10.216). The gauge couplings gz = 9, gr, 

and ggz are related to the g’ of the SM and to e by (10.218). The fermion covariant kinetic 
energies are 

Ly = Gi Pa? + aRi Pdh + Lipt, + CRIDER, (10.227) 


where the sum over families is implied. Ly includes the fermion gauge interactions 


T. wt 4 EBL 4 T. wet OBL 
Lpr= Gu fo: a + ama a E a 
Th wt = T wet a 
— Th fo: a e dae gR- 3o 3 Whr| th (10.228) 


A Higgs multiplet transforming as (2,2*,0) is needed to generate fermion masses and 
eventually break the SU (2) x U (1) symmetry. We introduce ® and its tilde form ®, 


of n) : . e. a 
® ( és go}? P = n*m -o 9" (10.229) 
in analogy to (8.14). Like most extensions of the SM, there are two separate SU(2) doublets. 
Unlike the MSSM, both neutral Higgs fields will have Yukawa couplings to both u and d 
when one includes the tilde couplings, implying Higgs mediated FCNC at some level. & 
transforms as er Daa 

® > ' =U, (8,)OUR(Bp)t = etft pe i r a, (10.230) 


and similarly for ®, and the corresponding gauge covariant derivative is 


wh 7-Wk 
D'd = "G4 (a? ; L® — gab 5 £). (10.231) 
The ® kinetic energy is 
Lo =Tr CZ (D"®)| (10.232) 


and the potential can be constructed in an obvious way from the invariants Tr (O'6,) and 
Tr (O10,6! d), where ®; jk, can be ® or ®. We assume that the neutral components of ® 


acquire VEVs, 
k 0 ~ K* 0 
(6) = ( ay aad ) , (6) = ( 0 at ) (10.233) 
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At least one additional Higgs field with tg, #4 0 is needed to break the electroweak 
symmetry to U(1)g, and it should be an SU(2) singlet with a neutral component to ensure 
Mw, > Mw,. There are a number of possibilities, but two are usually considered. 

In the doublet model, one introduces the pair of doublets 


ot of 
br ={ n ). n= ( 3 ). (10.234) 
L R 
1 


which transform as (2,1,5) and (1,2, $) respectively. The dz, is not really essential for 
SU (2), x SU(2) rx U(1), but is introduced to allow the possibility of a left-right interchange 
symmetry. We assume that the potential for ®,d; R, which we do not display, is such 
that |usr| = |(%)| > v/W2, where the electroweak scale v ~ 246 GeV is given by v? = 
2(\vsz|? + |x|? + |K’|?), with |vsr| = |(52)|. In that case SU(2); x SU(2)r x U(1) is broken 
to SU(2)z x U(1)y at scale |vsr|. The dz,r do not contribute to fermion masses. 

The other common model employs Higgs triplets transforming as (3, 1,1) and (1,3, 1), 


Ait Akt 
Ar=| A} l], Ar=| A} ts (10.235) 
At. AR 


One assumes that |var| = (A9) > v/V2 = (|k|? + |K'|?)!/ and that |var| = |(A?)| « 
v/v/2. The second condition is required by the pọ parameter in (8.182) on page 317 and 
Problem 8.1. The triplet model is popular because Ap can yield a |va grļ-scale Majorana mass 
term, ~ hARVÄRDEVR/V2+ h.c., for the right-handed neutrino (Mohapatra and Senjanovic, 
1980), similar to (9.107) on page 416. For Dirac masses comparable to the charged leptons 
and ha gvar = O(1 TeV) this would yield Ve u,- seesaw masses of order eV, keV, and MeV, 
respectively. These values were quite acceptable when the model was proposed but are now 
excluded. However, the charged lepton and neutrino Dirac masses are in principle indepen- 
dent (see Equation 10.245 below), so one could choose the couplings so that |var| and the 
heavy Majorana mass are at the TeV scale and the ve, u,- masses are in the experimentally- 
allowed range, i.e., the low-scale seesaw. Alternatively, the model can be reinterpreted as a 
GUT or high-scale model, probably embedded in SO(10). We also mention that a VEV for 
Az would yield a direct type II seesaw Majorana mass term ~ ha, vA, VL veJ 2+ h.c. The 
models also have the interesting feature of doubly charged Higgs fields Af h (Accomando 
et al., 2006). 


The Gauge Bosons 


The charged and neutral gauge boson mass matrices are obtained by a calculation similar 
to Problem 8.1. For the charged bosons, 
2 


Lw = (MẸ) „Wa Wy =X MPwiw;, (10.236) 
i=l 
where a,b = L, R and 
M2 M2? erty 
M, = ( = IR I 10.237 
w=\ Mpe M3 oon 


The elements are expressed in terms of the Higgs VEVs as 


I 1 
Mir = JIL,R (|? oe Kl? F seal + 2\va, zl”) = JIL RIL,R 
10.238) 
- 


Miz = —grgri|i'n*|, e = |e’ *]’ 
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where the mixing term is in general complex. The mass eigenstate fields are 


wi \_ cos e7™ sinÇ Wr 
( we ) E ( —sin e~“cos¢ ) ( we i ee) 


where the mixing angle ¢ and phase w are 


tan 26 = F = a es iy (10.240) 
an =T e” =se. š 
M? — Mr’ 


The + signs represent alternative phase conventions for Wz. The mass eigenvalues M?» 
are given by a formula analogous to (10.204). In the limit ng > nz, 


1 
momin lan, M~ Ma~ Leb 
10.241 
_ „IL lg | IR¢| < ee ( ) 
gr |nr}? gp |” \gL/ M3 


The neutral gauge boson sector is the same as for the LR model in Section 10.3.1. The 
mass-squared matrix is given by (10.201), with 


1 1 i 
M% = 3927 M} =h tob) Te A=- a (KHP), (10.242) 


where ñ% p are the same as nz p defined in (10.238) except 2|vA; r|? > 4|va, rl? The Z’ 
charges and a are defined in (10.219). For nr > nz, 


M2 1 _ {2 4 2 
anaes onl + Aeon (10.243) 
Mw, a? j vsr? + 2|vanl? 


The Yukawa Couplings 


The ® and Yukawa couplings to the quarks and leptons are 


-Lyw = > i (tan + Smn®) ety Be (tn + mn) pthc. (10.244) 


mn 


There may be additional couplings of the A; rz to the leptons, generating Majorana neutrino 
masses. The ® and ® Yukawa matrices are independent, allowing nontrivial quark and lepton 
mixings in the charged current interactions.°*? This leads to the quark mass matrices 


M” =rk+sK”, M’ = re! + sk*, (10.245) 


and similarly for the leptons. These can be diagonalized just as in the SM in Section 8.2.2, 
yielding a hadronic charged current interaction that generalizes (8.55), 


1 
ch = gou [ILVE WET Pr + gRVEWĘ' Pr] d+ h.c. (10.246) 
in terms of the mass eigenstate quark fields. Ve = Aut Ad is just the CKM matrix, while 


VË = Ay AT is the analogous mixing matrix for the SU (2)r currents, which (unlike in 


63Supersymmetric and SO(10) extensions require a second independent © field. 
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the SM) is observable. A similar expression holds for the leptonic currents, except that the 
right-handed current is absent in the low energy effective theory if the vg acquires a large 
Majorana mass from Ap. In terms of the mass eigenstate gauge fields, 


L}, = uy gt V4" Pr + grtance™ Vii PR d wi 
Wo y2 {an| (10.247) 
+ Uyu [-—91 tan ¢ VŽ Pr + gre” V? Prd wy" + h.c. 
The effective four-fermi interaction at low energies in (8.58) is replaced by 


4G p 
Hef = Va 


where the apparent (measured) Fermi constant is 


b d 
HJE + Jih + Shy St + Thy Tal > (10.248) 


Gr gicos*Ca 


= 10.249 
v2 8M? ( ) 

The V F A charge raising currents are 
I ay = OVE Prnt ton Pras (10.250) 


with the usual caveat about vr. The SM current Tis in (8.60) on page 272 is OF} The 
coefficients in Hey are 


a=1+Btan2¢, b*=c=e te tan (1-8) 
ae Me (10.251) 
— [IR 2 
d= (=) (tan ¢ + 8), ampe pe 
Left-Right Symmetry 


Our discussion of SU (2)z x SU (2)r x U(1) has so far been quite general. However, most 
studies invoke a left-right interchange symmetry, under 


wi > Wh, Wet > Wet, oF > Ons Ve > ae (10 252) 
® & GT, ôL © OR, Ar S Arp. 
The Lagrangian density is invariant under LR for 
gr=gr, r=r', s=si, t=tl, u=ul, har=har, (10.253) 


as well as appropriate constraints on the scalar potential. In this case, there is a formal P and 
C invariance at the Lagrangian level. However, the invariance is of a different character than 
for SU(3). or U(1)g because the gauge symmetries are still chiral under SU (2)z x SU(2)r 

The gauge symmetry breaking also breaks the LR symmetry. In particular, even though r 
and s are assumed to be Hermitian, M“ = rk+sx’* is not Hermitian if k and K’ are complex, 


64A discrete LR symmetry broken at the TeV level would lead to problems with cosmological domain 
walls, similar to Section 3.3.2, would not allow the development of a baryon asymmetry while C is conserved, 
and is not consistent with gauge coupling unification, giving more reasons why current studies generally 
assume that the gauge symmetry and/or the LR symmetry is broken at a high scale. 
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implying Ve x Ve However, many studies assume either: (a) manifest LR symmetry, i.e., 
r and s are complex (so that CP is explicitly broken) but « and «’ are real (or at least 
their phases are small). If they are real, ve = V (b) Pseudo-manifest LR symmetry, i.e., 
r and s and the other Yukawa matrices are real and symmetric, but « and «’ are complex 
(spontaneous CP breaking). The mass matrices are therefore complex-symmetric, so that 
Aj} = Ak K", where K” is a diagonal phase matrix, and similarly for the other fermions. 
Unlike the somewhat similar Majorana neutrino case (Equation 9.33), it is more convenient 
to choose the phases in AÙ? to put VÝ in the canonical CKM form. Then kK“ must be 
chosen to make the mass eigenvalues real and positive. The result is that the elements of 
ae and Ve may differ in phase, but they have the same magnitude, Vin nl= vV 


Experimental Constraints 


There are many modifications to the SM physics due to the Wr. The mass of the W ~ W, 
boson is lowered by the mixing effects (corrections to (10.241)). That and Z — Z’ mixing 
can modify the predicted Mw /Mz ratio and the relation of the masses to other precision 
electroweak observables. Low energy observables are affected both by Wz — Wr mixing and 
by Wer exchange, which lead to an admixture of V + A in the hadronic interactions. A 
corresponding V + A component to the leptonic current would only be present for a Dirac 
neutrino or a very light Majorana vg. Such effects have been searched for extensively in p 
and 8 decay, in weak universality tests, in new box diagram contributions to the Kz — Ks 
mass difference, as mentioned in Chapters 7 and 8, and in direct searches for a W’ or (for 
the case of a TeV-scale seesaw) heavy Majorana vp at the Tevatron and LHC (e.g., Maiezza 
et al., 2010; Cao et al., 2012; Patrignani, 2016). No definitive evidence has been observed 
for such effects, but the extracted limits on M2 and ¢ are dependent on the assumptions 
concerning gr/gt, eae w, and the nature of vr. Assuming manifest or pseudo-manifest 
left-right symmetry there is a stringent limit Mə 2 2.5 TeV from the contribution of the 
box diagram with one Wz and one Wr to Amg defined in (8.236) (Beall et al., 1982; 
Zhang et al., 2007). However, somewhat lower Mz would be possible more generally (even 
with gr = gL) (Langacker and Uma Sankar, 1989). The limits on |¢| are stringent even for 
general VŽ. For example, universality implies |Ç| £ O(0.0004) for gr = gz and w = 0 (for 
heavy vp), but the constraint is weaker for w # 0 (Problem 10.17). 

The low-scale seesaw versions of the model could also yield new contributions (Maiezza 
et al., 2010; Vergados et al., 2012; Faessler et al., 2014) to 660, that could be comparable to 
the ordinary light-neutrino exchange diagram in Figure 9.2 on page 376. The most important 
are the analogous diagram in which vz is replaced by the heavy vp and W- by W} (the 
mixing effects are small), and one involving (At)-— 

The direct production limits on W’ and vp depend on their relative masses, but generally 
extend into the several TeV range. W’ could decay into qq, £vp, or possibly W Z. A Majorana 
vr could subsequently decay to fgg (i.e., W” > vr —> llqq) via a virtual W’, where the 
two leptons could be same sign or opposite sign with equal probability. 


10.4 GRAND UNIFIED THEORIES (GUTS) 


The gauge problem described in Section 10.1 suggests the possibility of grand unifica- 
tion (Pati and Salam, 1974; Georgi and Glashow, 1974), in which the SM gauge group 
SU(3) x SU(2) x U(1) is embedded in a simple group G, with the quarks and leptons com- 
bined in the same multiplets. The Pati-Salam model achieved a partial unification linking 
quarks and leptons, based on the group SU (4)e x SU(2)r x SU(2)r. The electroweak part 
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incorporates the left-right symmetric SU(2), x SU(2)r x U(1)pr model, with the Tg, 
generator embedded in SU(4).. The color SU(4),. group extends QCD to include a fourth 
“color,” identified with lepton number, so that the fundamental representations for the first 
family are (uR, UG, UB, Ve) and (dr, da,dpg,e). The SU(4). symmetry would have to be 
spontaneously broken to SU(3). x U(1)sr at a sufficiently high scale. Alternative versions 
of the model involved integer charged quarks and extended electroweak groups. 

The first full unification of SU(3) x SU(2) x U(1) into a simple group was the Georgi- 
Glashow SU(5) model (extensively developed in Georgi et al., 1974; Buras et al., 1978), in 
which the strong, weak, and electromagnetic interactions are unified above the scale Mx > 
Mz of SU(5) breaking. Thus, the properly normalized SM gauge couplings extrapolated 
from low energy should meet at Mx. At the time the model was written the gauge unification 
worked reasonably well, suggesting Mx ~ 10'4~!° GeV. As mentioned in Section 10.2.6 the 
unification (using the more precise couplings determined subsequently) works much better 
in the supersymmetric extension of the SM, yielding My ~ 1016 GeV. 

The left chiral q, g°, £ and ° are unified in the same multiplets in SU(5) (though each 
family is still reducible). This implies electric charge quantization since there are no U(1) 
factors. It also predicts proton (and bound neutron) decay mediated by the new gauge 
bosons that connect quarks with leptons and quarks with antiquarks, by the diagrams in 
Figure 10.9. One expects decays into modes such as p + e+ 7°, with a lifetime 


Mx 


29 __. 
am Mx~10! ee (10.254) 


Tp ~ 


More detailed estimates allowed a somewhat longer lifetime, but nevertheless the original 
model was eventually excluded by searches at SuperKamiokande and elsewhere.” The gauge 
mediated lifetime is much longer in the supersymmetric version, ~ 1038 yr, but such models 
lead to faster decays into modes such as PK* by processes involving sparticles. The simplest 
forms of SU(5) also make predictions relating the quark and lepton Yukawa couplings, which 
are only partially successful. 

In this section we will describe the structure and some of the implications of the non- 
supersymmetric SU(5) model, and briefly comment on some extensions. Much more detailed 
discussions can be found in a number of reviews (Langacker, 1981; Ross, 1985; Hewett and 
Rizzo, 1989; Mohapatra, 2003; Patrignani, 2016). 


10.4.1 The SU(5) Model 
The Fields 


SU(5) is a rank 4 group (like the SM), with fundamental 5 x 5 representation matrices 
L? = )'/2, i = 1- -- 24, which generalize the SU(3) matrices in Table 3.1 in an obvious way. 
The upper left 3 x 3 block corresponds to the SU(3) subgroup (with indices a,b = 1,2,3 
denoted by a and 8), while the lower right 2 x 2 block (with a,b = 4,5 denoted by r and 


s) corresponds to SU(2). The normalized hypercharge generator Yı = yY is diagonal, 


is gs 5 1 

ap lias(-2 -2 3 3), THEY, = 5, (10.255) 
and commutes with SU(3) and SU(2). SU(5) contains 12 additional generators not con- 
tained in SU(3) x SU(2) x U(1), with nonzero values for Li, or L'.,. We will use the tensor 


Ly, = 


65The current lower limit on the partial lifetime into et 7° is 1.7 x 1034 yr (Takhistov, 2016). For reviews, 
see (Nath and Fileviez Perez, 2007; Babu et al., 2013). 
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Figure 10.9 Diagrams leading to proton decay in the SU(5) model. X and Y are 
color antitriplet gauge bosons with electric charges 4/3 and 1/3, respectively. 


notation introduced in (3.115) on page 111 for the fields, with a fundamental 5 written with 
a lower index, such as ®,, and an anti-fundamental 5* as Y°. 
The 24 adjoint gauge fields A* decompose under SU(3) x SU(2) x U(1) as 


24 — (8, 1,0) + (1,3,0) + (1,1,0) + (3,2",-3) + (3*,2, +2), (10.256) 
eee 
GÊ, w=,we B AT Aa 


where the third number is hypercharge, y = q — t’. This can be written as a 5 x 5 matrix 
24 xi 

A= A 10.257 

2 ee 


It is sometimes convenient to define the fields A? = Aa», These are non-Hermitian for 
a # b, with AÈ = (A$). The diagonal elements are not independent and are constrained 
by A£ = 0. Displaying the entries explicitly, 


ene «¢ G3 x, A 
G3 G3- 45 G3 Xə Yo 
A= G3 Gi G3-72 | Xs Ys , (10.258) 
x x? xe | We + 3B wt 
yt Y? Y? w- We SB 


where W+} = (W! = iW?)/V/2, W? = W°, and B are the SU(2) x U(1) bosons, and 
C=. G'A g/V2 are the gluon fields with GX = 0. There are 12 additional gauge 
bosons A® and A’, which carry both SU(2) and color indices. These can be written 

A? = X” [3*,2, qx = 


(10.259) 
AS =i" [3*, 2, qy = 
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where the SU(3) and SU(2) representations and the electric charge are shown. 
Each fermion family of left-chiral fields is assigned to a reducible 5* + 10 representation, 
where the 5* is anti-fundamental and the 10 is the antisymmetric product of two 5’s: 


"a E (10.260) 
10, => (3*,1,—)+ (3,2, =) + (1,1,1), 
~ 3 6 TU 
Pab=—Vba = i aa was 
aß ar 


where a,b = 1- -- 5; a, 8 = 1,2,3; and r = 4,5. We emphasize that the x and w are L-chiral, 
but we omit the subscript L to simplify the notation. The field content can be written 
explicitly as 


y’ = (a de? dæ eT =i 


0 u —u? —U1 —dy 
—uS 0  uř |-u -dz (10.261) 
w — co c2 _ 47k 0 = —d ` 
ab = V2 u u U3 3 ; 
ur uz u3 0  —et 


L 


where the family indices and weak-eigenstate superscript ° are suppressed. The lepton dou- 


blet ( w = ir? = ) in y transforms as a 2*, and the 1/2 in y is introduced to 


e Jr L 
avoid double counting. The antisymmetric part of 3 x 3 in SU(3) is a 3*, so the upper 3 x 3 


block of Y is Yag = €agyuy / V2. Similarly, the antisymmetric part of 2 x 2 in SU(2) isa 
singlet, Vs4 = —Wa5 = e} /V2. The fermion family multiplet can be extended to 5*+10+1, 
where the singlet is the vf. 

We also introduce a real adjoint Higgs multiplet ®, and a fundamental complex 5-plet 
H that contains the SM Higgs doublet (6+ = Hy and ¢° = Hs), 


ow H,= i (10.262) 
j=] o? 


The © fields have the same quantum numbers as the gauge fields in (10.258), and are 
+ 

introduced to break SU(5) down to SU(3) x SU(2) x U(1). The Higgs doublet ( “0 ) 

transforms as (1,2, 3), while its SU(5) partner Ha = (3,1,—§) is a color triplet with 

qu = —ż. 

The Fermion Gauge Interactions 


The gauge covariant derivatives for the fermion fields are 


(Dux)* = Ox —i EA 


v2 


(Duth)ay = pab + i Fa (Anat + i Fo (Ariat 


(10.263) 
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where gs is the SU(5) gauge coupling and we have used Li. = —Li”. The fermion gauge 
interactions are therefore 
Ly = Xai (PX) + Bi Dh) = LF" + LF", (10.264) 


where the fields are weak eigenstates and a sum over families is implied. £?™” includes the 
kinetic energy terms and the SM fermion gauge interactions in (5.2) and (8.7), with the 
restriction that the gauge couplings are unified, 


5 
93 = 92 =91 =95, where g3 = gs, g2=9, N= fie. (10.265) 
L% Y contains the leptoquark and diquark interactions of the X and Y, 


LEY =~ ® [eb E K die — 0% a 


v2 


leptoquark vertices (10 266) 
g5 = Ç -c ọ ` 
— Va Gan Kaug + eeyan Yadrel +h.c. 


aaan aaas 


diquark vertices 


These expressions can be transformed using the identities in (2.301) on page 56 and (2.302). 
For example, 


Ek K°dra=—di, K'e, Pa, Kaung = uh C Kauirg. (10.267) 


The transitions can be shown schematically (suppressing color) as 


X,Y X,Y 
—_— —_ << 
aE YL c + UL c Æ 
W I( = a) («i a ui) {w : 
5* 10 


Since the X and Y act both as leptoquarks and diquarks, both baryon and lepton 
number are violated (B — L is conserved), leading to proton and bound neutron de- 
cay by the diagrams in Figure 10.9. Prominent modes predicted by SU(5) included 
p > etn? eto, etw, etn,etnrtnr prt, ppt, and prr’. Bound neutrons can also de- 


M$ 2 
n zo = 210) = my di ; X,Y — fE 
cay into y7°, 7p°, Dw, vn,etm , etp, etc. One expects Tn ~ amg? where a5 = 7 (cf. 


5 


the muon lifetime 7, ~ My,/g*m?,). More detailed early estimates predicted 7, ~ 10°!*? yr, 
with the lower end of the range favored by current input parameters. The SU(5) prediction 
of proton decay motivated a number of sensitive searches involving large underground de- 
tectors. These eventually excluded most of the predicted range of lifetimes, i.e., they require 
Mx y 2 10'° GeV, somewhat larger than expected from the gauge couplings. Proton decay 
continues to be a stringent constraint on more modern versions of grand unification, which 
usually predict longer lifetimes. 


Spontaneous Symmetry Breaking 


The adjoint Higgs ® introduced in (10.262) has a potential (Buras et al., 1978) 


V(@) = ET (0?) + [re (oy)? + ET (0t) ET), (10.268) 
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with Tr (©?) = 676°, etc. Invariance under 6 + —® is often imposed, implying c = 0. For 
u? <0 one has (0|®|0) 4 0, and one can always take (0|®|0) to be diagonal by performing 
an SU(5) transformation. One can show (Li, 1974) that for b > 0,c = 0 the minimum is 


at&® 
3 3 = 2p? 
(®) = diag(ve vo vo — Zro — zve) with 13 =i T 


(Vacuum stability requires a > —7b/15, so vj is positive.) Therefore, SU (5) is broken to 
SU (3) x SU (2) x U(1) at the GUT scale 


(10.269) 


25 
M2 = MŽ = g Va (10.270) 


Twelve of the scalars in ® are eaten by the X and Y. The others attain masses of O(|,J), 
but are of little phenomenological importance because they do not couple to fermions. 

To further break SU (3) x SU (2) x U(1) down to SU (3) x U(1)g one must utilize the 
5-plet H, in (10.262). The potential in (10.268) for ® is extended to 


2 b 2 
V (8, H) = E Tr (0?) 4 $ [Tr (82)] + -Tr (0!) + 5 Tr (8°) + SHH 


+ à (HH)? +aH'H Tr (3?) + PHO? H + SHTOH, (10.271) 


where the SU (5) indices are contracted in an obvious way, and ® — —® invariance would 
imply c = ô = 0. Two severe fine-tuning problems immediately arise: (a) One needs to have 
(¢®?) = J ~ 10713vs, where v ~ 246 GeV is the weak scale (the GUT hierarchy problem). 


(b) The color triplet mass Mj, must be = 10!4 GeV = 10!?Mọ to avoid too rapid proton 
decay mediated by Ha (the doublet-triplet splitting problem). (Note that the 8 and 6 terms 
are the only ones in V(®, H) that can split the doublet mass from the triplet.) 

Both of these enormous hierarchies must emerge from the parameters in V(®, H). If 
the dimensional parameters are all of O(Mx) and the dimensionless ones of O(1) then it 
is natural for Mz, to be comparable to ve. One must then fine-tune the parameters to one 
part in v/v? ~ 1074 to obtain sufficiently small v and Mọ. The fine-tuning problem is 
actually even worse, because it must apply to the entire effective potential. That is, one 
must “retune” the parameters at each order of perturbation theory to avoid destabilizing 
the hierarchy.°" 


Yukawa Couplings and Fermion Masses 
The SU(5) symmetry does not allow couplings of ® to the fermions, but allows 


Ly uk =Ymn xeTe Wab Ht 


10.272 
+Imn COO YT p C Ynca He ie Ca + h.c., ( ) 


where m and n are family indices, I is symmetric, € is antisymmetric with «!2°4° = 1, and 
the weak eigenstate superscripts ° have been suppressed. The fermion masses are generated 


°6The minimum is of the form diag(v4, vi, Vh v4, — 4v4,) for b < 0,c = 0, implying that SU(5) > 
SU(4) x U(1). 

67A small parameter is said to be technically natural if a new global symmetry emerges when it is 
set to zero, such as the chiral symmetry that is introduced when a fermion mass vanishes. That ensures 
that only one fine-tuning is needed, because renormalization effects are proportional to the original small 
parameter (’t Hooft, 1980). Unfortunately, there is no such new symmetry in the present case. 
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from (Ha) = 6°-%. The first term yields 


V2 
Lae = —d, M"dp — ef Mt ef + hic. = —diyM "dp — €pM°er + h.c., (10.273) 
where 1 
M’ = M = uk (10.274) 


That is, the SU(5) symmetry predicts that the d quark and charged lepton mass matrices 
are the same up to a transpose,’ because they derive from the same coupling. Equation 
(10.274) implies that the eigenvalues are equal at Mx, i.e., 


Ma = Me; Ms = Mp, mp = Mz. (10.275) 


This appears to be a disaster. However, the situation is partially remedied because the 
Yukawa couplings are running quantities, mainly from the gauge loops. The largest effects 
are from the gluons, which make the quark masses larger than the leptons’ at low energies. 
To lowest order (Buras et al., 1978) 


non e ea a lacy e [easy] 4927 


0 


where F is the number of families. This implies mẹ ~ 5 GeV for m, ~ 1.7 GeV and 
F = 3, consistent with the observed value. However, (10.276) fails for the light families. To 
avoid uncertainties in the overall scale of the light quark masses, one usually considers the 
predicted ratio, 

Ue (10.277) 


Me Ma 


Equation (10.277) fails by an order of magnitude, with m,/me ~ 200 and ms/ma ~ 20 
from (5.122) on page 193. This suggests a more complicated Higgs structure, such as the 
introduction of an additional Higgs 45-plet with family symmetries to restrict the form of 
the Yukawa matrices (Georgi and Jarlskog, 1979). The second and third terms in (10.272) 
lead to the u quark and Dirac neutrino mass matrices 

Av 


V 
M* = M*? = ŻT M% = —=ri. (10.278) 
V2 ’ D V2 


Gauge Unification 


We have seen in (10.265) that the properly normalized SU (3) x SU (2)xU (1) gauge couplings 
should all be equal at Mx, implying that the value of the running weak angle is predicted 
at Mx, 

g? 35 3 
g2? +g? 143/5 8’ 


32, (Mz) = (10.279) 
where we use the MS angle defined in (8.139) on page 299. To compare with experiment 
the couplings must be run to low energy (Georgi et al., 1974), using (10.195). The SM 
coefficients are given in (10.196), but it instructive to follow the historical approximation, 


68The transpose is exploited in the lopsided neutrino mass models mentioned in Section 9.5 where a 
nonsymmetric y induces large mixings in Af and the unobservable Ag, The mechanism is more difficult to 
implement in SO(10), because the simplest analog involving Higgs 10’s is symmetric. 
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using the expressions in (8.198) for an arbitrary number F of families but ignoring the 
(small) contribution of the Higgs, 


F 1 [22 4F 1 4F 
LO Yan??? 167? É 3 | > O 6r | 3 | noes) 
These can be solved to yield 
a(M2) 3 lla(M2), M2 oy, 1 5 a(M2) 
==)1 l Mj =p A 10.281 
as(M2) 8 2m M2 |’ 82(Mz) = & + 9 as (M2) peer 


independent of F. (Corrections do depend on F.) Plugging in a,(M?) ~ 0.12 and 
a( M2) ~ 1/128 yields &% (M3) ~ 0.20 and Mx ~ 10!° GeV, which is a reasonable zeroth 
order prediction. However, as already discussed in Section 10.2.6, more precise two-loop cal- 
culations with the current inputs do not quite work, as can be seen in Figure 10.8. Starting 
with a and as, one finds My ~ 10 GeV and 8%(M#) ~ 0.21, which is too low. Taking 
the modern view of using a and 8%(M#) as inputs yields a very low a;(M#) ~ 0.07 and 
Mx ~ 108 GeV, both of which are unacceptable. 


Cosmology 


The SU(5) model apparently offered an elegant explanation for the observed baryon asym- 
metry of the Universe (Section 10.1), i.e., the out of equilibrium decays of the superheavy 
partners Ha of the Higgs bosons (Equation 10.262) (Yoshimura, 1978). Although the life- 
times of the Ha and their conjugates HS, must be equal by CPT, the rates into specific 
corresponding modes could differ by C and CP violating effects (a partial rate asymme- 
try (Okubo, 1958)), e.g., 


TH gd) ATH? > aq), TH 44) ATH > ol’). (10.282) 


Combined with the intrinsic B violation and the nonequilibrium (i.e., the decays occurred 
at T < My) all of the Sakharov ingredients to generate a baryon asymmetry were ful- 
filled and an adequate asymmetry (with B — L = 0) could be generated (though only in 
non-minimal models). Unfortunately, it was later recognized that the symmetry would be 
subsequently erased by nonperturbative sphaleron effects. The idea could be resurrected 
in more complicated models in which a nonzero B — L was produced in the decays, the 
most plausible being leptogenesis.°? Other possibilities for the baryon asymmetry, such as 
electroweak baryogenesis, do not involve out of equilibrium decays. 

A serious difficulty for grand unification involved magnetic monopoles, which are topo- 
logically stable gauge/Higgs configurations present when a simple or semi-simple group is 
broken to a group including a U(1) (Coleman, 1985). One expects a superheavy monopole 
mass Mm ~ Mx /a, and most likely an efficient production during the phase transition re- 
sulting in the symmetry breaking. Although MM pairs would annihilate efficiently, enough 
would be left over to greatly overclose the universe (Preskill, 1979; Kolb and Turner, 1990). 
Suggestions for solving this monopole problem included the possibility that electric charge 
was not conserved at intermediate temperatures, leading to rapid annihilations (Langacker 
and Pi, 1980). However, much more compelling was the suggestion that a subsequent period 
of inflation would dilute the density to negligible values, as well as resolving other cosmo- 
logical problems (Guth, 1981; Kolb and Turner, 1990; Lyth and Riotto, 1999; Linde, 2008). 
Magnetic monopoles and their experimental constraints are reviewed in a general context 
in (Patrizii and Spurio, 2015). 


69 Another difficulty with GUT baryogenesis was that any asymmetry would be erased by a subsequent 
period of inflation. This continues to be a constraint on leptogenesis models. 
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10.4.2 Beyond the Minimal SU(5) Model 
Supersymmetric Grand Unification 


It is straightforward to extend the SU(5) model to supersymmetry (Dimopoulos and Georgi, 
1981; Raby, 2009). Just as in the MSSM, one must add an additional Higgs multiplet 
transforming as a 5*. The Higgs part of the superpotential becomes 


Wen = peTr (8°) + Ae Tr (8°) + pe Hy Ha + Aon HaHa, (10.283) 
where 
Ha Ho 
Hua= | At |, i=) W J, (10.284) 
h? ha 


® is defined as in (10.262), and we are using the same symbols for the superfields and their 
scalar components. The Yukawa couplings to the matter supermultiplets are analogous to 
(10.272), with H > Hu and Ht > Ha. 

As has already been emphasized in Section 10.2.6 and Figure 10.8, the gauge unifica- 
tion is much more successful when the couplings are extrapolated using the MSSM £ func- 
tions (Dimopoulos et al., 1981), with the couplings approximately meeting at Mx ~ 3x 101° 
GeV. Using the observed @(M?), 8?,(M?), and a,(Mz2) as inputs, the two-loop unification 
works precisely for a common scale Msygy ~ 5 TeV for all of the new sparticles and the 
second Higgs doublet. However, there could be additional threshold corrections at the TeV 
scale due to splitting between the masses of the new particles,” as well as ones due to mul- 
tiplet splitting or higher-dimensional operators at the GUT scale (Langacker and Polonsky, 
1993, 1995; Carena et al., 1993; Bagger et al., 1995). The m,(Mx) = m,(Mx) prediction 
in the supersymmetric case works well for tan 8 = v,/vq ~ 1 or 40 — 50 (e.g., Barger et al., 
1993; Langacker and Polonsky, 1994; Carena et al., 1994), with the larger value consistent 
with a Higgs mass of 125 GeV. 

Because of the large value of Mx expected in the supersymmetric case, the contribu- 
tion of dimension-6 operators such as wude/M% (from X and Y exchange) are greatly 
suppressed, leading to proton lifetimes of O(10°°) years. However, there are dangerous new 
dimension-5 operators (Weinberg, 1982; Sakai and Yanagida, 1982; Dimopoulos et al., 1982; 
Ellis et al., 1982) such as usdv/Mx or Cdue/Mx. These are generated by the exchange of 
the fermionic partners of the H and H° from the Higgs multiplets Hu,a, which have the 
same Yukawa couplings as those that generate the fermion masses. The operators must 
involve a second or third family because of the commuting nature of the superfields. They 
may be dressed by the exchange of a gluino, neutralino, or chargino, to produce a proton 
decay operator such as usdv/Myx. Since there is only one inverse power of the large mass 
scale, the proton lifetime Tp ~ m3/M, which is dominantly into modes such as p > K+ 
or n + 0K°, is relatively short even after taking into account the small Yukawa couplings 
and additional couplings and loop factors from the dressing. The SuperKamiokande limit, 
T(p > VK*) > 6.6x10°8 yr (Takhistov, 2016), excludes the minimal supersymmetric SU (5) 
and SO(10) models, while versions with non-minimal Higgs sectors predict lifetimes within 
a factor of a few of the current limits (see, e.g., the Grand Unification review in Patrignani, 
2016). Of course, one must also exclude proton decay from squark exchange with R, vio- 
lating couplings, just as in the MSSM. These are known as dimension-4 operators because 
they are not suppressed by any powers of the GUT scale. 


“0The low-scale threshold effects tended to be of the wrong sign for most breaking/mediation schemes 
(in which the colored sparticles are usually heavier) for the now excluded possibility that the masses were 
in the several hundred GeV range. 
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The SU(5) € SO(10) € Eg Chain 


Each left-chiral fermion family in SU(5) is assigned to a reducible 5* + 10 representation, or 
5*+10+1 including a singlet vf. These are combined in an irreducible 16-plet when SU (5) 
is embedded in the larger rank 5 SO(10) group (Fritzsch and Minkowski, 1975; Georgi, 
1975), which requires a vz. In addition to the breaking pattern SO(10) + SU(5) x U(1), 
described in Section 10.3.1 and considered here, SO(10) has alternative breaking patterns 
into (a) flipped SU(5), in which the hypercharge generator Y includes a component of 
Qx, leading to different identifications of the particles in the 16-plet (Barr, 1982); (b) the 
Pati-Salam group. 

In SU(5) the mass matrices M? = M*?, M”, and M% are all independent. In the 
simplest extension to SO(10) the fermion masses are generated by a single Higgs field in 
the vector (10) representation, which decomposes into 5+5* under SU(5). This implies two 
Higgs SU(5) multiplets H, and Ha with two Higgs doublets, hu,a, just as in (10.284), even 
in the non-supersymmetric version. (In SU(5) a single Higgs can play both roles because the 
Yukawa couplings in (10.272) involve both H and Ht, analogous to the SM tilde couplings 
in (8.13).) There is only a single w1g¥1¢¢19 Yukawa matrix, which is symmetric in family 
indices, with the immediate consequences 


M? = M® = M°, M“ = tan 8M?, M” = My. (10.285) 


The first relation is similar to the SU(5) case (except for being symmetric), while the second 
is a consequence of Yukawa unification, i.e., the b, 7, and t Yukawas are all equal at Mx. 
In the supersymmetric case the Yukawa unification for the third family works well for large 
tan 6 ~ 40 — 50. However, both relations fail badly for the first two families. Not only do 
they give incorrect predictions such as m,,/Me = Ms/Ma = M_/My, but the second relation 
implies a trivial CKM matrix, V} = I. The last relation equates the (Dirac) neutrino masses 
to the u quark masses, which is obviously a disaster. 

A realistic spectrum therefore requires an extended Higgs sector. Introducing additional 
10-plets resolves the problem with the CKM matrix, but not the other difficulties. Similar 
to the addition of a 45 in SU(5) one can obtain a realistic spectrum by introducing very 
large 120 or 126 dimensional multiplets. The 126 also allows couplings that can generate 
Majorana masses for the vf, allowing a type I seesaw, and/or Majorana masses for vz, (the 
type II seesaw), as described in Section 9.5. For that reason, SO(10) models are frequently 
used, in connection with family symmetries, to generate models of neutrino mass, or more 
generally, models for fermion mass textures. Instead of large representations (which are 
very unlikely if there is an underlying superstring construction) one can supplement the 
WieV1e¢10 Yukawa couplings with higher-dimensional operators (Babu et al., 2000; Pati, 
2006). 

The even larger Fg (e.g., Slansky, 1981; Hewett and Rizzo, 1989), which has been sug- 
gested in some heterotic string compactifications, is rank 6 and includes the SO(10) x U(1)y 
subgroup mentioned in Section 10.3.1. It has an alternative maximal SU (3)e x SU(3)z x 
SU (3)r subgroup, referred to as trinification when a permutation symmetry on the factors 
is imposed (Achiman and Stech, 1978). The SO(10) x U(1)y embedding extends each fam- 
ily to a 27-plet, which breaks to SO(10) and SU(5) as in (10.220). The 16 contains a SM 
family, while the 10 are new exotic fermions, 


E? E? 
10 = ( = ) + ( ) +21 +Dr (10.286) 


and the 1 is a singlet Sz, similar to the vf. The 10 and 1 are quasi-chiral, i.e., vector 
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or singlet under the SM but charged under the extra U(1)’ s (Table 10.2). The Dz pz are 
exotic charge -4 quarks, while the vector pair of doublets are usually considered to be 
exotic leptons in non-supersymmetric studies. However, in the supersymmetric case they 
are often considered to be Higgs doublet superfields instead, as mentioned in Section 10.3.1. 
(The interpretation depends on what global symmetries are imposed on the superpotential.) 
The Eg model has an especially rich phenomenology because of the U(1)’ s and the exotic 
multiplets, and is often studied for that reason (e.g., King et al., 2006; Kang et al., 2008). 


Extra Dimensions and Strings 


In orbifold GUTs (Kawamura, 2001; Raby, 2009) the grand unification is present in a 
higher-dimensional space, but the gauge symmetry of the effective four-dimensional theory 
is (usually) that of the MSSM. Such constructions allow new symmetry breaking mecha- 
nisms associated with boundary conditions and background fields in the extra dimensions, 
and they may retain many of the desirable features of grand unification (such as gauge 
coupling unification, third family Yukawa relations, etc.) while avoiding some of the dif- 
ficulties (e.g., the doublet-triplet problem, too rapid proton decay, and the need for large 
Higgs representations). 

Grand unified theories do not incorporate gravity, and are therefore not as ambitious as 
superstring theories (e.g., Ibáñez and Uranga, 2012). Heterotic string theories include under- 
lying grand unification symmetries. They may compactify into an effective four-dimensional 
GUT. However, it is difficult to generate the adjoint and other large Higgs multiplets intro- 
duced in most bottom-up constructions, so the more promising versions more closely resem- 
ble orbifold GUTs (e.g., Raby, 2011). The heterotic theories may also compactify directly 
to the SM or MSSM, or to an extended version, with limited memory of the underlying 
GUT. Constructions may retain simple MSSM-type gauge unification, or the unification 
may be modified (and complicated) by the effects of new matter multiplets that survive to 
low energy and/or by the string scale gauge coupling boundary conditions, especially for 
the U(1)y (e.g., Dienes, 1997). The fermion families or the elements of the families may 
have different origins in the construction, breaking or modifying GUT Yukawa relations and 
possibly leading to family nonuniversal couplings to new U(1)’ s. 

Type IIA intersecting D-brane constructions usually do not involve a full underlying 
GUT, but they often descend to four dimensions using a Pati-Salam group (e.g., Blumen- 
hagen et al., 2005). Some versions with large enough stacks of branes do allow an SU(5) 
(or flipped SU(5)) unification, as do some Type IIB theories (Blumenhagen et al., 2009a). 
In both cases, an underlying U(5) gauge symmetry forbids a perturbative top Yukawa 
coupling, which could, however, be generated nonperturbatively by instantons. Especially 
promising are the related F-theory constructions, which allow more flexibility and can allow 
such extended groups as SU(5), SO(10), and Ee (e.g., Heckman, 2010). 


10.5 PROBLEMS 


10.1 The two-loop generalization of (5.33) for a non-chiral SU(m) gauge theory with np 
massless fermion multiplets (flavors) in the fundamental L,,, representation is 


dg? 


TE = TOP) = balm, nj)g4 + balm ny), 
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where (e.g., Dietrich and Sannino, 2007, and references therein) 


1 11 4 1 à 
bi(m, nf) = (47)? | 3 C2(G) sT] = Gon np) 
1 34 20 1 4 
bo(m, np) =— (anys [Foxe = z C2(G)Tr = 4C2(Lm)TF = — Trj nf) 
2 
— 1 
O(G)=m, Te=F,  O2(Lm) = 


We will examine the running of g(Q?) from some initial large scale Q3 down to lower scales 
as a function of m and ny. It will be useful to think of m > 2 and ny as continuous 
parameters, which can later be specialized to integer values. 

(a) For small ny both b, and by are positive, but each reverses sign for larger ny, i.e., at 
nar and np, respectively. For nf < nar the theory is asymptotically free. If nfp < nf 
as well, there is a value gẹ = (—b;/b2)!/?, known as the conformal fired point, for which 
B(g?) = 0. For some range of initial values g(Q?) will increase from g(Q?) until it reaches 
gx and will then remain ~ constant for smaller Q?. (Such a behavior is invoked in some 
models of dynamical symmetry breaking.) Calculate nar, nfp, and a, = g? /4r. 

(b) Plot g(t) vs t = In(Q/Mz) for m = 3 and some values of ny and g(to) that exhibit the 
fixed point behavior. Take Qo = Mp. 

(c) For ag = g?/47 larger than some value a, the SU (nf) x SU (np) chiral flavor symmetry 
will be spontaneously broken (analogous to QCD), and the fermions will acquire mass and 
decouple. Thus, the fixed point behavior only occurs for a, < œe, which holds for ny larger 
than some critical value ne. The region max(ne, nfp) < nf < nar is known as the conformal 
window. Calculate ne using the approximate criterion a. = 7/[3C2(Ly,)| and plot nar, Ne, 
and nfp vs m for 2 < m < 5. 


10.2 The left and right chiral projections uz, r of the u spinors in the chiral representation 
in (2.192) both reduce to the form uz, R(0,s) = Vmọs in the particle rest frame, p — 0. 
Show that the forms in (2.192) for arbitrary p = 8 ym can be obtained by acting on 
uz R(0,s) with the (active) Lorentz boost A(¢ = —ĝÊtanh™* 8), using the representation 
matrices A(1 o) and Avg 1), respectively. 

10.3 Prove (10.50) using the explicit form for Ac o) in (10.29). 

10.4 Let yım and PYəm be two Majorana fields written in 4-component notation, as in 
(10.58). Use the two-component spinor formalism developed in Section 10.2.2 to prove 


pım dom = orveml vim, 


with op = +1 for T = 1,7°, and y“y°, and op = —1 for T = y“,c”. Note that this 
rederives the relations (9.20) (symmetric mass matrix), (9.64) (antisymmetric magnetic 
transition moments), and Problem 9.4 (no vector neutral current coupling) for Majorana 
fields, derived previously using four-component techniques. 


10.5 Prove the Grassmann variable identities in (10.68). 


10.6 Prove that the Lagrangian density in (10.69) is invariant under the supersymmetry 
transformations in (10.70). Hint: the easily derived identities 


ata” +o” = 2g" T, ato” +a" oh = 2g""T 


are useful. 
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10.7 Verify (10.81). 


10.8 Derive (10.87) and (10.117). 


10.9 Show using (8.199) that the logarithmic term in (10.164) can be interpreted as the 
running of the SM quartic Higgs coupling À from a supersymmetry breaking scale Ms down 
to the electroweak scale v = O(m). Assume that all of the superpartners have masses Ms 
and that the running of \ is dominated by the hé term, with h; ~ constant. 


10.10 (a) Calculate the cross sections for e~e* —> Zh and e~e* — Ah via a virtual Z in 
the s-channel in the MSSM. Assume that one is well above the Z pole, and neglect the e+ 
masses. 

(b) You should find that 


where kzp and kan are the magnitudes of the final CM momenta for each reaction. Interpret 
this result in terms of the equivalence theorem. 


10.11 Derive (10.178). 


10.12 Consider the cascade decay 

Xs 3 e ef, č} + et x 
in the MSSM, where Xo 2 and čt are all on-shell, with (unknown) masses mi 2 and me, 
respectively. Assume that the x? is not observed and that the only kinematic information 
obtained for the event are the four-momenta p+ of the et and e~. Show (neglecting the e* 
masses) that the maximum possible invariant mass of the ete~ pair is 


Mmet — J (m3 = mz) (mz — mi) f 
ee me 


Note that the actual Mee distribution is expected to show a sharp cutoff (the kinematic 
edge) at M?:°7, allowing a good measurement of that combination of superpartner masses. 


10.13 Consider the reactions pp > qG or G, where q can be qz or gr with q = u, d,c, 
or s, and similarly for g°. Assume that all of these squarks and the gluino have the same 
mass, m, and ignore mixing between families or between L and R. 

(a) Draw, but do not calculate, the parton-level Feynman diagrams to lowest order. The 
parton-level cross section, which is the same for gz, r, 97, and qh, is given in the Cross- 
Section article in (Patrignani, 2016). 

(b) What is the largest m for which there would be at least 100 events at the 14 TeV LHC 
with an integrated luminosity of 100 fb~!? 


10.14 Suppose that the SU(2) x U(1) model with a single Higgs doublet ¢ is extended to 
SU(2) x U(1) x U(1)’, as in (10.197). Let Qg be the U(1)’ charge of ¢, and assume that 
Mz is much larger than Mz and My due to the VEV of some SM singlet field. 

(a) Calculate the Z — Z’ mixing angle 6. 


Beyond the Standard Model 505 


(b) Calculate the decay widths for Z’ — WtW- and Z’ > ZH in the limit that Mz, My, 
and My can be ignored. Hint: use the equivalence theorem with ¢ given in (8.85). 

(c) The direct calculation of the Z’ + WtW- rate (not using the equivalence theorem) 
is proportional to |0|? x (Mz/Mz-)*. Yet, the calculation in (b) indicates that it does not 
vanish for Mz/Mz > 0. Explain this apparent contradiction. 


10.15 Consider the U(1)’ extension of the MSSM, with the neutral Higgs potential given in 
(10.209). There are now six neutralinos: the four MSSM ones, the singlino $, and the U(1)! 
gaugino, Z’. (a) Write the 6x6 neutralino mass matrix corresponding to (10.184) in the weak 
basis, and analyze the spectrum in the U(1)! decoupling limit, s = V2(S) >> (Msoft, V, Mef f); 
with g2Qs,u,a fixed and comparable to g and g’. 

(b) Show that the gauge/Higgs spectrum in the decoupling limit includes the the MSSM 
particles and a degenerate massive vector supermultiplet associated with Z’, Ž F Š , and S. 


10.16 Derive (10.218) and (10.219). 


10.17 Show how the universality test in (8.224) on page 342 is modified in the SU (2)z x 
SU(2)r x U(1) model. Assume that ¢ and 6 are small, gr = gr, VE = Ve; and that the 
Vp are very heavy. What are the approximate limits on ¢ for w = 0 and for w = 7/2? 


10.18 (a) Assuming (®) takes the form in (10.269), verify the expression for v3. 
(b) Verify (10.270). Hint: use Problem 4.3. 
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APPENDIX A 


Canonical Commutation 
Rules 


In this appendix we briefly summarize the commutation rules and other aspects of standard 
field theories. 


Spin-O Fields 
Consider a spin-0 field ¢ and Lagrangian density £, as defined in Section 2.2. The conjugate 


momentum operator to ¢@ is defined as 


ôL ÔL 


n(zr) = ——, a (2) = — ; (A.1) 
60(x) do" (x) 
where db = oe and the second definition is for the case in which ¢ is complex. For a spin-0 
field, the canonical equal time commutation rules are 
t, Z), d(t,#’)] = 0, nt, Z), a(t, zZ) =0 
[o(t, 2), oE] in(t,2),m(t,2") rc 


in(t, #), d(,2")] = -i (E — 2"). 
If @ is complex, similar rules hold for ¢' and a1, while [m(t,Z), ' (t, #’)] = 0. If there are 


distinct fields Qi, then [m:(t, z), Qj (t, z')| = 183 (z — T')ôij. 
The Hamiltonian density is 


HT, 6) = rọ + rtt — Llo, Oud). (A.3) 
The Hamiltonian H(t) and momentum P(t) operators are 
H(t) = I ËZ HT, o), P(t)= / Pz [nVo +7 gi], (A.4) 
so the four-momentum operator is 
P#(t) = (H,P) = I Bz [mato +r gt — gL]. (A.5) 


(The upper case P is used to distinguish the momentum operators from the ordinary c- 
number momentum p".) If ¢ is real then the second terms in (A.3)—(A.5) should be omitted. 
H(t) and P(t) are actually independent of ¢ unless there is explicit t dependence in L. 
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The Hermitian scalar field is described in Section (2.3), with the Lagrangian density 
given in (2.20). The conjugate momentum is m(x) = 6£/6¢(a) = (x), and the Hamiltonian 
density is 


H(n, 4) = 5 [6 + (Go)? + mg] + Vo). (A.6) 


Similarly, the Lagrangian density for the complex scalar field of Section 2.4 is given in (2.88), 
implying q(x) = ¢' (x), n(x) = (a), and 


H(r, 9) = 5) (3) + (Vo!) - (Vd) + moto + Vi(d,¢"). (A.7) 


For the free Hermitian field one can use the explicit form for ¢9(x) in (2.25) to obtain 


3 => 
peat J TE p [alp (p) + at (Pa) 


2J (2n)32E, 
- ferr | MO +20] mn 
(27)32E, 2 |’ 
where 
N(B) = al @)aP) (A.9) 


is the number operator, which counts the number of particles carrying momentum p. The 
6°(0) term in (A.8) integrates to 0 for u Æ 0. For u = 0 it is the simple harmonic oscillator 
zero-point energy. It is infinite due to the integration over p, e.g., it leads to an infinite 
vacuum energy 


3 
(O|H|0) = O fate p2 + m2 = oo. (A.10) 


(The 63(0) is an artifact of our use of continuum state normalization. For box normalization 
in a volume V it would be replaced by the density of states V/(27)?.) This constant of the 
energy is unobservable when considering the microscopic interactions,! so it is customary 
to ignore it. Such terms can be systematically removed by a technique known as normal- 
ordering, which means that creation operators are always written to the left of annihilation 
operators in a product (with an appropriate minus sign for fermions). The normal ordering 
of an operator O is sometimes indicated by the symbol :O:, so that :al ag:=:a2a1:= alan, 
while :a1a2:= a a. We will always assume normal-ordering for momentum and other related 
operators, such as the currents and charges associated with internal symmetries, but will 
not display the ::. Thus, (A.8) is replaced by 


PH =P= f dp Enip a al Pp N(B A.11 
=F = (2n)92B, ” a'(p)a(p) = (27)226, ? (P). (A.11) 
Similarly, for the free complex scalar field in (2.93), 
P = | worm? [N4 (P) +N- (P)], (A.12) 
where 
N4 (B) =a'(p)a(p), N- (B) = t (P)O(B). (A.13) 


lIt would become observable when gravity is included, so more care is needed in that case. The zero-point 
energies cancel between bosons and fermions in supersymmetric theories. 
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The Noether current in (2.100) corresponds, for the free field, to the charge in (2.104), i.e 


” 
Q= | opie NO) -NP = Ne -No (A.14) 


where N,,+ are the overall number operators for 7~. 


Spin- Fields 


The Lagrangian density for a Dirac fermion in (2.152) implies that the conjugate field is 
Ta = ÔL/ő a = ipi. Fermion fields satisfy canonical equal time anticommutation rules 


{Valt, Z), Yet, £')} =U, VEE Z) oh, 2} =0 


(A.15) 
{Yalt Z), Wht, Z} = (Z -— #)bap. 
The expression for 7, implies 
H=rmdb—-L=iwt), Pra J dzvtiony, (A.16) 


which hold even in the presence of gauge and Yukawa interactions. For the free field in 
(2.159), this implies 


3 > 
> Jam or ena TEE] 


27) 326, 
(A.17) 
Brae 27) om? t [N (P, s) +N- (P, s)|- 
The Noether current for fermion ~ is 
J” = yt, (A.18) 
while the corresponding charge for the free Dirac field in (2.159) is 
2 Bp 
— —— D. = — D, = Ny — Nye. A.19 
a= J oR M9) MG] = Ne — Ny (A.19) 


Spin-1 Fields 


The Lagrangian density for the free electromagnetic field is given in (2.111). Discussion of 
conjugate variables is complicated due to gauge issues, so we will only quote the Hamilto- 
nian, 


H= 5 | ee (£? + B?) l (A.20) 


For the free field in (2.113), 


_ dp fics oo dp 2 
P! = 2 | ap’ (p, A)a(p, A) -D fame A). (A.21) 
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APPENDIX B 


Derivation of a Simple 
Feynman Diagram 


Let us sketch the derivation of the first term in (2.30), i.e., the tree-level amplitude My; = 
(p3p4|M|pp2) corresponding to the first diagram in Figure 2.3 for the Hermitian scalar field 
with K = 0. 

Our starting point is to consider a transition from an arbitrary initial state 7 to an 
arbitrary (but different) final state f with the same four-momentum. As shown in texts 
in quantum mechanics and field theory, the unitary transition matrix element Us; and 
transition amplitude My; can be calculated in perturbation theory using the interaction 
picture, to yield! 


Upi = (f|U (400, —00)|i) = (20) 454 £ Pr- Xn) Mji, (B.1) 
up to subtleties concerning disconnected diagrams. In (B.1), 
(FIU (ta, ta)é) = (FIT [eter EO 44) jay (B.2) 


is the time evolution operator, 7 is the time-ordering operator, and 
H(t) = Lilt) =~ f ËE £1(00(2)) (B.3) 


is the interaction part of the Hamiltonian in (A.4). The fields occurring in Hy; are free 
fields. For the non-gauge interactions of spin-0 particles, £; = —V;. For our example, we 
can expand the exponential and keep just the linear term in Hy (the identity term doesn’t 
contribute to the scattering amplitude), 


Up = f abe (Popilici(oo(e)|vir), (B.4) 


where 
Ago (x) 


T (B.5) 


iLr(¢o(2)) = —iVr(%o(2)) = —i 


1Some authors extract a factor of i from their definition of M. 
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Substituting the expression (2.25) for the free field ¢9(x), one finds 


AJl 27) arom 


d 
x [@e(0lasas II jane Pa + al ete] ala}]0), 


n=a 


(B.6) 


where a; = a(pı), etc. One can now move the a’s to the right and the at’s to the left. The 
nonvanishing terms in the matrix element in (B.6) are? 


4! et(PatPe—Pe—Pa)'® (O azat asal aca! aqa}l0), (B.7) 


where the 4! is due to the fact that there are 4! non-zero terms that differ only by the 
relabeling of the dummy indices a---d. Finally, one has 


aqal, = (27)?2E48° (Da — po) + aaa > (27)32E48° (Da — Po), (B.8) 


where the alaq term vanishes since a|0) = 0, and similarly for the other three terms. The 
momentum integrals can then be done using the delta functions, while the f d‘x integral 
yields (27)*64(p4 + ps — pı — p2), so that 


Upi = —id(2)*5* (p4 + ps — pı — P2), Mf; = ~id. (B.9) 


The Feynman rules for more complicated diagrams involving internal lines can be derived 
using the Wick ordering theorem. 


2The terms involving the commutators [ap, af], q = a---d, vanish for non-forward scattering. More 
generally, it can be shown that they should be ignored in such calculations. 


APPENDIX C 


Unitarity, the Partial Wave 
Expansion, and the Optical 
Theorem 


The S matrix is a unitary operator describing the evolution of an initial state. It is given 
by S = U(+oo, —oo), where the time evolution operator U(t2,t,) is defined in (B.2), i.e., 
Sfi = Upi. The transition matrix T is related by S = J + iT. It is convenient to extract a 
momentum-conserving 6 function from a matrix element, 


(f|S|é) = Spi + i(f|T li) = Spe + (27)48 (pf — pi) tT yi, (C.1) 


where the amplitude M defined in (B.1) is related by My; = iTi. The unitarity of the S 
matrix, 


(StS) = > Shp Sni = 5p, (0:3) 


n 


implies the unitarity formula (for pr = pi) 


—i (Tye — Th) = 29mT ps = Y (21)*8* (Pn — Pi) TH Tri (C.3) 


n 


Applying this to the special case f = i and using (2.51) yields the optical theorem, 
23m Tii = Aky/S Otor(k), (C.4) 


where Ctot(k) is the total (elastic plus inelastic) cross section at CM energy ys and k is 
the center of mass three-momentum defined in (2.37). Thus, the optical theorem relates the 
imaginary part of the forward elastic scattering amplitude to the total cross section. 

For elastic two-body scattering 7 is related to the traditional scattering amplitude 
f(k,0) (especially familiar in quantum mechanics) by 


T =-iM = 8rv/sf(k, 0) (C.5) 
(see, e.g., the Kinematics article in Patrignani, 2016). Thus, the CM cross section is 
do 1 
—~ = |f (k, 0)? = M/. 
oo = Ik D = aa IMI (C6) 
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The partial wave expansion for spinless or spin-averaged particles (see Weinberg, 1995, for 
the general case) is 
1 


aa 5 (22 + 1)ae(k) Pe(cos 6), (C.7) 
=0 


where P; is the #” Legendre polynomial. The partial wave amplitude is 


[ne(k)e*e(*) _ 1] 


a ; (C.8) 


ag(k) = T dcos 6 f(k, 0) Pe(cos 0) = 


pth 


where d¢ is the phase shift, and 0 < ne < 1 is the inelasticity parameter. From unitarity 


lag(k)|? < Smag(k) <1, (C.9) 


with the first inequality saturated for purely elastic scattering (ne = 1). The partial wave 
amplitudes, and their graphical representation in terms of the Argand plot, are especially 
useful for parametrizing the behavior of low energy amplitudes (where few partial waves 
are important), searching for and studying the properties of resonances, etc., as described 
in introductory and quantum texts. They are also useful for unitarity bounds on the elastic 
cross section o(k) (integrated over angle), which are useful in connection with the breakdown 
of the Fermi theory and with theoretical constraints on the Higgs mass: 


= o) -5 Fl (20 + 1)|ae(k)|*, (C.10) 
so that 4n(20 + 1) 
+ 
celk) < 2 (C.11) 


In terms of f(k, 0) the optical theorem is 


4 OO 
otot(k) = Sm f( = E NO (28+ 1)3m a(k) > o(k). (C.12) 
L=0 


The optical theorem, combined with some general considerations involving analyticity and 
crossing, can be used to derive the Froissart bound (Froissart, 1961; Pancheri and Srivastava, 
2017) on the high energy behavior of the total cross section, 

Tiot < C n? 5, (C.13) 


for s + oo, where C is a constant. 


APPENDIX D 


Two, Three, and n-Body 
Phase Space 


Techniques for evaluating 2-body phase space were considered in Section 2.3.5. Here we 
formalize that discussion and consider some techniques for the 3-body case. More general 
discussions are given in (Barger and Phillips, 1997) and in the Kinematics article in (Pa- 
trignani, 2016). 

Let us define the Lorentz invariant 2-body phase space factor 


dda d? Dp 
Ea Ey i 


dz (m7, ma, M6) = 5° (pr — Pa — Pb) (D.1) 


where py is the total four-momentum. For a decay process, m? = p? is the mass-squared of 
the decaying particle, while for a scattering process m? = p? = s. We saw in (2.55) that in 
the rest frame or CM (where pr = (mz, 0)), 


dz (m3, m2, m?) = dn?! — 2rd cos 0ft —> An PE (D.2) 
My MI MI 


where dQ = dọ dcos@ is the solid angle element of pa, and 


(D.3) 


with A(x, y, z) = £? +y? + 2? — 2xy — 2xz — 2yz. The differential cross section for 2-body 
scattering is 


d 1 
do = = {\? = zi |M]? dcos9, (D.4) 


yg? = — 
6in2p, Js | 3278 p 
where the initial momentum p; is given in (2.38). Similarly, a 2-body differential decay rate 
is 

dv = 


d 
M? = 


M|? dcos®ð. D.5 
32r? M] al | cos ( ) 


ae 


These results can be immediately generalized to chile phase space, relevant to I > 


fi- fn. Defining 
= (pr - Yrs) I Pir, (D.6) 
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the differential cross section and decay rate are, respectively, 


iS 


4 
27) cn| M|? da; dv = Ca) 
mr 


Cn|M|? dn, (D.7) 
with cn = [2(27)’)] 7”. 

The case n = 3, with 
d Pa dD d? Pe 
E, Ey Ee’ 


d3 (mi, m2, m$, m?) = 54 (pr — Pa — Pb — De) (D.8) 
is very important, especially for decay processes. We will sketch two techniques. The first 
method readily generalizes to n bodies. It involves the factorization of d3 into a product 
of 2-body factors, one of which describes b + c and the other describes a + bc, where bc 
represents the collective b + c system. The technique is especially useful when b and c are 
not observed (a simplified version was used for u decay in Section 7.2.1), when there is a 
resonance in the bc channel, or when a is the first particle emitted in a cascade. To begin, 
we introduce the factor 


(mr—ma)? 
1=| dmg, 5 (mi. — (pe + pe)”) 
(my-+me)? (D.9) 
x [pre ôt (Pec — Pb — De) © (Erc) 


in the expression for d3. We can then rearrange the expression to yield 
d’ Pa A Poc 
Ea Ere 


dp d Pe 
E b E c 


d3 = = J dm? 5* (pr — Pa — Poc) 64 (Poc — Pb — Pe) 


(D.10) 


II 


1 
5 [ami də (mi, m2, Mie) dy (mo Mg» m2) . 


Thus, d3 is just the product of b + c phase space, restricted to invariant mass Mpe, and 
phase space for a + bc, integrated over the allowed range shown in (D.9) for m?.. Of course, 
the matrix element |M|? occurs under the integrals. The separation is especially useful if 
|M]? also factorizes. It should be remembered that the dy’s are Lorentz invariants, so it is 
often convenient to evaluate the factors in different Lorentz frames, such as the I and bc 
rest frames. As a simple example of factorization, suppose there is a narrow resonance in 
the bc channel, with, e.g., 


2 f (my, ms.) 2,2 m 2 2 
|M] TE M2” + MAT? > PUT Me) TTR (mie — Mp), (D.11) 
as shown in Figure D.1. Equation (D.11) is the Breit-Wigner resonance formula for an 
unstable particle R of mass Mp and width rp = 1/rr propagating in the bc channel, as 
described in Appendix F. f(m?, m32.) describes the splitting of the initial state into a+ R 
and the decay of R into bc. The final form is the narrow resonance approximation, valid for 
Tr < Mr. In this case 


də (m?o, me, m2) 
(m2, — M2)” + M272, 
(Mh, mg, m?) . (D.12) 


cC 


1 
d| M|? =5 Jor. də (m3, m2, me) f(m, mga) x 


TF 
= də (mi, m2, Mz) f(m, M?) x MPa 
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b 
c 
a 
Ioe 
R 
Iia 


Figure D.1 Three-body final state with a resonance in the bc channel. J is the ini- 


tial decay or scattering state, g}, is the amplitude to produce aR, and gÈ is the 
resonance decay amplitude. f(m?, m2.) in (D.11) is given by |g? |?|g/|?. 


Now let us further suppose that f factorizes into terms involving the production and de- 
cay, which would be the case for a spin-0 resonance, i.e., f(m}, MZ) = |g |?|g#|?. Then, 
incorporating the relevant factors of 2 and 27 from (D.7), one finds 


o(I > abc) = o (I > aR)B(R > bc), T(I > abc) =T(I > aR)B(R- be), (D.13) 


where B(R — bc) = T(R —> bc)/TR is the branching ratio for R — bc. This reproduces 
the intuitive result that a cross section or decay rate involving an intermediate narrow (i.e., 
long-lived) state factorizes into the cross section or rate to produce the narrow state times 
the branching ratio into the final state. An interesting feature of this result is illustrated 
by assuming that R can only decay into bc, so that B = 1. In this case the overall rates 
in (D.13) are independent of the coupling strength gë. This is completely different from a 
virtual intermediate state, and is due to the fact that once R is produced it always decays. 
The same holds if there are other decay channels so long as B(R — bc) is held constant. Of 
course, the rates do depend on the production coupling |g® |. 

An alternative method is useful when |M|? only depends on pa: py, Pa Pe, and Po: De; 
such as a decay process summed and averaged over spins. In the J rest frame Pa,b,c lie in a 
plane. By assumption, |M|? does not depend on the orientation of the plane or the direction 
of Pa in the plane, so we can integrate over them. The spatial ô function can be used to 
eliminate pe. The angle between pa and pp can be expressed in terms of the energies of a 
and b using the remaining energy 6 function, so that d3 becomes an unconstrained integral 
over Ea and Ep within an allowed region. The result, derived in (Barger and Phillips, 1997), 
is 


2 2 
ds (m?, m2, m}, mZ) = 2r°m? dXadXy = T dmz.dm?.., (D.14) 
T 
where we define A 
2E; . 
X; = i hi = LA i = a,b,c, (D.15) 
mI MT 
and 2 : 
me, = (pi + pj) = (pr — Pr) =m? +m? — 2m; Ek, (D.16) 


where 2,7, and k are all different. The invariants can be expressed as 


1 
2 


Pi: pj = 5m} (1+ uk — Hi — Hj — Xk), (D.17) 
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which follows from (D.16). The energy or invariant mass variables satisfy 
Xa + Xo +X. = 2, M2, +H MŽ. + Mm? = mi +m? +m +m. (D.18) 


However, the actual boundaries for Xa,» are rather complicated (see Barger and Phillips, 
1997), so we only give them for some special cases. 


Ma,b,c =0: 
0<X, <1, 1—Xq < X) <1 


Ma E 0, Mbc =0: 


U? < XaLltta, a-B<SXy<at+B aa 


Ma, b = 0, M: #0: 
1— Xa — Me 


<X,<1- C3) 1— Xa- SAs 
0< = H H b 1=X, 


An approximation to this technique was used for pion beta decay in Section 7.2.3. The 
method is applied to calculate the leading correction to that result in Problem 7.7. 

This method is closely connected to the Dalitz plot, in which individual events observed 
in a 3-body process are plotted in the Xa — Xp or the mZ, — m2. plane. From (D.14) the 
phase space is uniform within the allowed region, so an excess of events would correspond 
to an enhancement of the matrix element. A resonance in a two-body channel would show 
up as an excess along a line parallel to the x or y axis, or along a line x + y = constant. 


APPENDIX E 


Calculation of the 
Anomalous Magnetic 
Moment of the Electron 


In this appendix we sketch the derivation of the one-loop (Schwinger) contribution of a/2r 
to ae = (ge — 2)/2, the anomalous magnetic moment of the electron, in part to illustrate 
some of the techniques for calculating Feynman integrals. The relevant contribution is from 
the diagram in Figure E.1, which will be of the form 


tegtglay" + b(pi + p2)")ur, (E.1) 
ss 


H 
Pou 


where 14, = ZI T” — 7" in the notation of Section 2.12.1. From the discussion following 
(2.359) and using the first Gordon identity in Problem 2.10, we identify ae = F2(0) ~ —2mb, 


P2 
S 
e S 
k eT enn 
5 / P2— Pı 
\/ pi-k 
Pi 


Figure E.1 One-loop vertex correction contributing to the anomalous magnetic mo- 
ment of the electron. The photon momentum is q = p2 — pı. 
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which will turn out to be non-divergent. From the Feynman rules, 


m (ie)? f d*k —i i i 
QL (27)t k?+ie (po— k}? —m?+ie (pı —k)? — m? + ie 


x [av (e-k) (h K+ m) 9” (6.2) 
_ —ie? B" (pı, p2, k) 
=a fe 


where 


BY (p1, pa, k) = Ww (p2— K +m)” (~i— +m) 7” (E.3) 

D = (k? + ie) (k? — 2pə - k + ie) (k? — 2p, -k + ie). l 
We have used pî = m? and have replaced eĝ by e? and mo by m since the differences are 
of higher order. (We set Z4 = 1 in the identification of F>(0) for the same reason.) It is 
understood that the external electrons are on-shell and that ró z is sandwiched between u 
spinors. The d*k integral appears formidable, but it can be handled by a series of tricks. 
First, one uses Feynman parametrization to combine the denominators. There are a number 
of ways to do this, but we will use the identity 


1 
sede stl — 3.2 
1 = wf dz +++ dzn0 ( Li? ) (EA) 
a102: : An 0 [do aizi] 
for any a,---a,. For example, 
1 d 
=| á M (E.5) 
a12 o [az +a(1-— z)] 
Applying (E.4), 
X p T dzıdzədz3ő (1 — > zi) _ 1 Dz (E.6) 
D o [k? — 2pə - kz2 — 2pı - kzı + ie] o [k2 — c+ ie] 


where 


Dz = dzıdzədz3ð (1 — 5 z) ; k! = k — poze — 7121 


: 7 (E.7) 
c = (p222 + p121) =m? (1— 23)" — @z122, 
with c > 0 for q? < 0. Therefore, 
_ —2ie? e Me (PLak) p, 
k2 — c+ ie’ (E.8) 
e a par atk , BY (p1, p2, k” + poz tma p, 
[k2 — c+ ied? ? 


where we have shifted the integration variable dtk —> d*+k’ in the second line. The terms 


linear in k’ integrate to zero 

d*k! k 

T) (E.9) 
[k/? — c + ie] 


Calculation of the Anomalous Magnetic Moment of the Electron 521 


The quadratic terms involving k/,k;, must be of the form, 
d+ k! k! k!, dék’? k”? 
JEER Com [ EEE man 
[k2 — c + ie]? [k2 — c + ie] 


since the 1.h.s. of (E.10) is a Lorentz tensor, and g,» is the only tensor available. Contracting 
each side with g“” implies that C = 4. Finally, using the y matrix identities in (2.174) as 
well as uz po = Muz and piur = muy, 


BY (p1, Po, k + p2z2 + piz1) =2m (pı + po)” |-22122 + 2 (1— z2) + z1 (1 — 21) 


(E.11) 
+7" terms + vanishing terms. 
Therefore, dropping the prime on k, 
—2 1— 1— D 
Fa(0) = 8im7e ff far z1z2 + z2 (1 — z2) + a (1— 21)] a (Œ.12) 
[k2 — c + ie] 


where c in (E.7) is evaluated at q? = 0, i.e., c= m? (1 — 23)”. 
Now, consider the integral 


dtk Foa Bk 
=f 7 an al dko | F =a (E.13) 
[k =C + ie] —oco [ke = k2 — C + ie] 


where c > 0. In is convergent for n > 2. The ko integral can be viewed as a contour integral 


along the real axis in the complex ko plane. The integrand has poles at ko = +( v k2 + c—ie), 
which lie in the second and fourth quadrants. One can therefore perform a Wick rotation, 
in which the contour is rotated to lie along the imaginary axis, 


+ioo +00 Bk 
Le dk Te k 7 =: f dk J en E.14 
—ioo — f2 — c] ~co = [—ka — k? — c] ( ) 


where kog = —iko is the Euclidean energy, and we have taken e — 0 since the rotated 
contour is far from the poles. This can then be written 
2 d* © kdk 
Rath if oe kp a aoe E 
[k3 + ¢]” [k2 + ¢]” (E.15) 
ee 1 1 ` 
= (—1)” in 


(n—1) (n-2) = 


where kg is a Euclidean four-vector with k?, = k2p +k?, k3.dkp = k?,dk2,/2, and f dQ4 = 

27°? is the area of a unit 3-sphere in four dimensions (Problem 1.3). The integral has poles 

for n = 1 and n = 2, indicating the divergence of the integral, but is well behaved for n > 2. 

These techniques can be readily generalized to ones such as f d*kk?? / [k? -c+ ie] g 
Putting everything together, the anomalous magnetic moment is 


1 — — — 
Fa (0) z Q f [ 22122 + 22 (1 z2) + 24 (1 21) dz,dz,dz36 (1 = y zi) 
0 


T (l= zg)" 
a 


1 
= i | 14 | dadz20 (1-3-2) = &. (E.16) 
0 zı + 29 2T 


T 
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APPENDIX F 


Breit-Wigner Resonances 


An unstable particle or resonance of mass Mr and width T = 1/7 can be treated in Feynman 
diagrams by replacing the denominator of its propagator by the Breit- Wigner resonance 
form 


1 
—— > 
q? — M2? + ic 


1 
= -M2 +iMpR’ 


D(a’) (F.1) 


so that 
1 


(q? — M2)? + MÈT? 
IfT is large, then additional energy dependent corrections may be important away from the 
peak (see, e.g., Gounaris and Sakurai, 1968). Away from the resonance peak a rate involving 
a resonance is suppressed compared to the on-shell rate by MÈI? /(q? — Mz)? « 1, and 
far enough off shell the finite width effect is irrelevant. (There may be additional energy 
dependent factors associated with phase space, etc.) 

For a narrow resonance, [/Mp < 1, it is sometimes useful to employ the narrow width 
approximation 


ID(@)? = (F.2) 


i 
Mr 
which effectively treats the production of the long-lived state as if it were stable. This was 
used in a 3-body phase space example in Appendix D, where it was shown that the produc- 
tion rate of a given final state through a narrow intermediate spin-0 resonance factorizes 
into the rate to produce the resonance, times its branching ratio into the final state. This 
result generalizes to more complicated production and decay processes and to differential 
cross sections and decay rates. One can still use (F.3) for narrow resonances with nonzero 
spin. However, the factorization into production rate times branching ratio only holds if one 
averages or sums over the external spins and integrates over all of phase space. Otherwise, 
there are spin and angular correlations that do not factorize. 

Let us consider an example of an unstable Hermitian spin-0 field ¢ that can couple to 
fermions a or b, with 


ID’)? > 5(q° — Mp), (F.3) 


L = (gatatla + gover) $, Ja,b = real. (FA) 
The amplitude for aa — bb via the s-channel ¢ resonance is 
M = i? Ja gott Uaua D(s), (F.5) 
where we take Mp = mg and T =T, in D(s). Neglecting Mma and mp, 
S 1 
|M]? = 7949 |D(8)|? Tr (B5 po) Tr (Da Ba) = Gages" |D(s), (F.6) 
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so that the cross section is 


a(s) = |m = 92% s (F.7) 
167s 16r (s — M2)? + M2r? 
R R 


Note that it is usually not useful to use the narrow resonance approximation for an s channel 
process unless one is integrating over s. Equation (F.7) can be conveniently rewritten using 


z 1 7 
Ch. T a) = ———— |Ma]? F. 
DPC aa) = Tapi eP. (F.8) 
with 7 
|Maa|? = |igal? Tr (ta Pa) = 2MRge, (F.9) 
so that the spin-summed partial width into aa is 
= g MR 
Toa = ae (F.10) 
8T 


with a similar formula for I',j. If there are no other decay channels, then F = Paa + Typ. 
From (F.7) and (F.10), 


An(s/M2)Taal yp 4T 


a 
als) = Goa) + MBI? some’ ME 


Baa Bog, (F.11) 


where Baa = Daa /T is the branching ratio into a@. This convenient form absorbs the cou- 
plings gq,» into the partial width or branching ratio factors. It also illustrates that the peak 
cross section at s = m% is independent of the coupling strengths for fixed branching ratios 
because they cancel between the partial and total widths. Similarly, for a massive vector 
resonance V,, with interactions 


L = — [Japa ha + goy vo] Vu, (F.12) 


one finds the same result as (F.11) except that 4r — 12r (Problem 2.27). 

By measuring (s) as a function of s one can determine MÊ from the position of the 
peak, T from the width of the peak, and a combination of branching ratios from the peak 
cross section peak = E( MÈ), as illustrated in Figure F.1. This was extremely powerful, for 
example, in the Z lineshape measurements at LEP. (In practice, one must apply additional 
energy dependent corrections.) In some cases, however, the detector energy resolution is not 
good enough to resolve the peak and determine I. One then effectively integrates over s to 
obtain (for the spin-0 case in (F.11)) 


_ An? _ 
[alias ~ Mp 17 Bee: (F.13) 


The expression in (F.11) is typical for an s-channel resonance (see, e.g., Weinberg, 1995), 
provided that spins are averaged (summed) and the phase space is integrated over. The over- 
all coefficient depends on the spins of the resonance and of the initial particles. Sufficiently 
near the peak, the cross section for a1a2 — b1b2 is 


4(2Sp + 1) A(s/MAV aial bats 


a(s) = Oe + DOsat) e- M2)? + MZT?’ 


(F.14) 
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O peak | 


Mk 


Figure F.1 The cross section vs. s for the Breit-Wigner resonance form in (F.11), 
illustrating the peak position, width, and peak cross section. 


where SR, Sai, and Sa, are respectively the spins of R,a1, and az (with 29 + 1 replaced 
by 2 for a photon or gluon). The prefactor is because the 25r + 1 intermediate spin states 
contribute equally to (and do not interfere in) the total spin-averaged rate, while (25,,+1)~! 
(2S,, + 1)~1 corrects for the fact that o(s) is spin-averaged while Paja, is spin-summed. 
There is an analogous correction if there is a color sum/average on a, and/or az, and a 
factor of 2 if aı and ag are identical (to correct for the statistical factor of 1/2 in the phase 
space integration in (2.81)). There may be additional s dependence away from the pole, 
due, e.g., to vertex factors or threshold effects. 
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APPENDIX G 


Implications of P, C, T, and 
G-parity for Nucleon Matrix 
Elements 


In this appendix we summarize the implications of various discrete symmetries for the 
matrix elements of vector and axial currents between on-shell nucleon or hyperon states. 
Recall that even though the weak interactions violate these symmetries, e.g., because of the 
V — A structure of the charged current, the hadronic matrix elements are dominated by 
strong interaction effects, which are believed to be invariant under P, C, and T (Section 
2.10), and approximately so under G-parity (Section 3.2.5). However, higher-order weak 
corrections could lead to small deviations from the symmetry predictions. 


Crossing Symmetry 


Consider an amplitude or matrix element involving an initial particle b(p, s), 


(a|O|6 b(p, s)) = O(P) uP, 5), (G.1) 


where a and 8 are other external particles. The spinor-valued function O(p) depends on 
D= p and on the momenta and spins of the other particles. Crossing symmetry implies that 
the corresponding matrix element for a final b° is described by the same function, i.e., 


(ab°(p, s)|O|8) = LO(p)u(B, s), (G.2) 


where in this case p = —p, and the + sign depends on how many fermion interchanges are 
involved in the crossing. In both (G.1) and (G.2), p and s refer to the physical momentum 
and spin, while p refers to the momentum flowing in the direction of the arrow in a Feynman 
diagram. Although the same function O(p) describes both processes, the physical values of 
p are in different regions, i.e., the two processes are related by analytic continuation. A 
similar relation applies to the interchange of a final particle and initial antiparticle. The 
crossing symmetry is obvious from Feynman diagrams but holds more generally, as can be 
proved using the LSZ reduction formalism. We will be concerned here with matrix elements 
of an operator O between single particle states, 


(a(pa)|O|b(po)) = Ua(Pa)O (Pa; Po) Ue (Po), (G.3) 


OU 
N 
NJ 
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where the spin labels are not displayed. Applying crossing to both a and b, 


(b° (po) |O|a*(pa)) = —Va(Pa)O(—pa, —po)vo (po) = +t (po)Oc(—Pa, —Pb)Ua(Pa), (G4) 


where O. = CO?C™! (Table 2.2) and the second form follows from (2.298). 


Vector and Axial Form Factors 


We will be considering matrix elements of the vector and axial vector currents 
Vh, = = pay p = viet ) A, = Payt po AM, (G.5) 


Analogous to (2.394), assuming only Poincaré invariance and the Gordon identities, the 
most general matrix element of V” between the corresponding single particle states is of 
the form 


iof” 
2m 
HV a5 


(a| VE (x) |b) = Ga | EKo l P) + a Foe (@?) + F al) 


10 
2m 
= Tal Ui (gus, (G.6) 


+ gial) + Ial) + I al) use’? * 


where q = pa — p and the mass m can be taken to be (Ma + mp)/2. The labels a and b on 
the states and spinors refer collectively to momentum, spin, and flavor. When we specify 
the special case a = b we mean that the flavors are the same, but the initial and final 
momenta and spins may differ. We will henceforth take x = 0. A similar expression holds 
for (a| A“, |b), with superscripts A on the form factors and matrix, FA,, gA,, T4/(q). 


Space Reflection 
As was already discussed in Section 2.10, space reflection invariance requires 
V A 
Ta (q )= YT Yala jP, Ta (4) = -PT Aala y? (G.7) 
where q, = Pau — Pp » = q”. This implies g\., = FA, = 0, up to weak interaction corrections 


(such as the anapole moment mentioned below (2. 396) on page 78). All of the results are 
summarized in Table G.1. 


Charge Conjugation 
From (2.296) the currents transform under charge conjugation as 
CVC! = -VÉ, CALC T=H=+4+At. (G.8) 
Therefore, using (G.5) and charge conjugation. 
(al Vi (2) |b) = (DIVE (@)la)* = —(0*1VE (2) a)". (G-9) 


(G.6) and the crossing relation in (G.4) then yield 


aa TYH (qu = — [ar y4- Jua) = -ü TE (q). (G.10) 


cab 
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TABLE G.1 Restrictions imposed by discrete symmetries on the form factors 
associated with vector and axial vector currents. 


Space reflection: gyp =0 F4, =0 
Charge conjugation: FY, = FY* A = g2 
g Jug £ iab iab Jiab Jiab 
A — A V V 
Fia = =F ab Jiab = —Jiab 
Charge conjugation (a = b): FY a = Il Sui =0 Piia = Gos =0 
Hermiticity (a = b): | ieee Sa PA = 4 
others real 
Time reversal: aie = FVA gé = oe : 
G-parity: POS ame FeO i =O 
p y: 3pn I2pn 1,2pn 91,3pn 


Applying similarly reasoning to A“, one obtains 


=V =A 

ra (a) = Teala) TEO = Hoal), (G.11) 
which are satisfied for real FY and g^, and for imaginary F^ and gY. As we will see, time 
reversal invariance implies real form factors, in which case both P and C independently 

imply Fe =o =0. 
In the special case of a = b, one has CV#C~! = —V# and CA¥C—! = +A, from which 

it follows that 

DEl) = Tuala) Tae = Tk: (G.12) 
(This also follow from (G.11) and the Hermiticity of V” and A, which imply that 
aO 
(gY a and FA 


3aa 


(—q).) Thus, the only C-allowed terms are FY’5,44; 93aa> Ie saa and FA 


aa 3aa* 


would have to be imaginary.) 


Time Reversal Invariance 


It follows from (2.317), (2.318), (2.325), and (2.326) that all of the form factors must be 
real (or at least relatively real for non-standard phase conventions) if time reversal holds. 
We already saw that for diagonal (a = b) transitions, Hermiticity requires g¥,,, and FA, 
to be imaginary. Thus, observation of an intrinsic electric dipole moment (g¥,,, 4 0) would 
imply the violation of both space reflection and time reversal invariance. 


G-Parity 
The G-parity transformation G = Ce’? in (3.131) is useful for the proton-neutron currents 
associated with the weak charged current. One has 


Gp WnG = +p Yn, Guppy nG! = py Yn (G 13) 
Gp) =|n°), Gn) = —|p*). 

Combining with the crossing relation in (G.4), G-parity implies (Weinberg, 1958) 
Duela) = -Tonla Tla) = rila) (G.14) 


for the weak charged current form factors. Equation (G.14) is satisfied by Fi’), and OF sens 
which are known as first class form factors, and violated by the second class ones F3on 
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and ae (The situation is reversed for the parity-violating form factors.) There was once 
considerable activity in @ decay searching for second class form factors (including some 
short-lived positive indications). Such effects, if significantly larger than those expected 
from isospin breaking, would presumably be due to second class currents with the opposite 
G-parities from those in (G.13). It is almost impossible to construct viable models involving 
second class currents within the general quark model framework, however (e.g., Langacker, 
1977). 


APPENDIX H 


Quantum Mechanical 
Analogs of Symmetry 
Breaking 


Many of the field theoretic possibilities for symmetry breaking and realization described in 
Section 3.3 have simple analogs in ordinary non-relativistic quantum mechanics. Consider 
a single particle of mass u moving in the potential 


Axt 
4 ? 


illustrated by the dashed curve in Figure H.1. (We take A > 0 in all of the examples.) The 
minimum is at x = 0, and for sufficiently small À the energy eigenvalues are approximately 
those of a simple harmonic oscillator, Ep = (n + iw. V exhibits a reflection symmetry, 
V(x) = V(—2), so the energy eigenstates Yn(x) will have either even or odd parity under 
x — —a. This will be true even when the effects of the Axt term are non-negligible. 

One can break the reflection symmetry explicitly, i.e., in the equation of motion, by 
adding a small linear term to the potential 


1 
V(a) = gu D — (H.1) 


1 Azt 
V (a) r guo a? ex + m (H.2) 


The minimum is now at 29 ~ €/uw? # 0. Expanding around zo, 


Aa — zo)t | 


: ji + O(e?). (H.3) 


V(a)~ 5H ( z0)? + Axo(x — 20)? 4 
The energy eigenvalues (in this example) are unchanged to O(e), but the reflection symmetry 
(whether around the origin or around xo) is broken. The energy eigenfunctions will no longer 
have a definite parity. 

Another possibility is to maintain the exact reflection symmetry of the potential, but to 
break it spontaneously, i.e., in the solutions to the Schrödinger equation. For 


1 4 
V(x) = — shea + m (H.4) 


(the solid curve in Figure H.1) the origin x = 0 is unstable, but there are degenerate minima 
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at +29, where zo = (pw? A). One can expand around the minimum, e.g., at +29, to 
obtain 
A(x — zo) 
4 ? 
where we have dropped an irrelevant constant V (xo) = —u?wt/4A. Assuming that the 
barrier is sufficiently high that one can neglect tunneling, we can quantize around xo. For 
small enough cubic and quartic terms, the energy eigenvalues are ~ (n + 5) V2w. Again, 
the reflection symmetry is lost. One can also combine explicit and spontaneous breaking 
by adding a perturbation —ea to (H.4) (the dotted curve in Figure H.1). This breaks the 
degeneracy between the minima at +29, and also shifts their position slightly. For « > 0 the 
minimum near +29, known as the true or global minimum, is deeper. It is straightforward to 
calculate the O(e) shift in energy eigenvalues. For small enough e€ there remains a shallower 
metastable minimum near —% 9, known as a local or false minimum. 


V(x) ~ +uw? (x — 29)? + Axo(a — zo)? 4 


(H.5) 


A 


Figure H.1 Left: one-dimensional potential in (H.1) (dashed), (H.4) (solid), or (H.4) 
with an additional —ex term (dotted). Right: two-dimensional Mexican hat poten- 
tial in (H.7), which leads to spontaneous symmetry breaking. 


Now consider a particle moving in the two-dimensional potential 
Aa? +y")? 1 oo, Art 


1 
V(z,y) = zue (ar? H y’) H 4 a gh r ṣ4 4? (H.6) 


where r? = x? + y?. The potential is independent of the polar angle 0 = tan™t y/x, and 
is therefore invariant under the SO(2) group of rotations on x and y, as well as under the 
reflections (sign changes) of x or y. The energy eigenstates can be classified by the IRREPs 
of SO(2), i.e., as eigenstates of L, with eigenvalue m = 0,+1,+2,---. For small enough à 
the energy eigenvalues are given by the harmonic oscillator formula En = (n + 1)w, where 
n = 2k + |m| with k = 0- - -oo the radial quantum number. Equivalently, n = nz + ny in 
a rectangular basis. The degeneracy of n is n + 1. Adding a perturbation ey?/2 to V (x,y) 
n (H.6) breaks the O(2) symmetry down to discrete sign changes (i.e., rotations by 7 
and sign changes in x or y). The degeneracy of the eigenvalues is broken, with En, n, = 
(nz + 5) w+ (ny + 4) w’, where w ~ w + €/(2uw). 
The Mexican hat potential, illustrated in Figure H.1, 
Aa? +y*)? 1 Arf 


1 
V (x,y) = = Spee? (z H y’) + 4 = zu t 4? (H.7) 


differs from (H.6) by the sign of the quadratic term. The origin is unstable, and there are 
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a circle of degenerate minima at radius ro = (uw?/A) \/? Similar to (H.5) one can expand 
V(r,@) around ro, 


Ar — ro)‘ 


V(r, 0) ~ +p? (r — ro)? + Aro(r — ro)? 4 1 ; (H.8) 


where we have again dropped a constant. For suitable parameter values, V(r,0) can be 
approximated by a harmonic oscillator in the radial direction, centered at rọ and with 
frequency /2w, along with a flat potential in the @ direction, leading to non-degenerate 


eigenvalues 
2 


Bete h a (H.9) 
2 2ur6 
The second term represents the zero-frequency “rolling” of the particle along the flat 6 
direction, and corresponds to quantized momentum km = m/1r9. The m = 0 mode has no 
energy in the angular variable and is analogous to the Nambu-Goldstone bosons encountered 
in Section 3.3.3. For large ro the excitations are closely spaced. 

One can add a small explicit breaking term ey?/2 to (H.7). If the corresponding frequency 
\/e/p is large compared to the excitation energy ~ (2ur2)~1 of the rolling but still small 
compared to w, the energy eigenvalues become 


1 I 
En,.no as (n, ae 5) 2 =F (no F DV (H.10) 


i.e., the Goldstone modes acquire a minimum vibrational energy. 
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meson-baryon, 106 
MSSM, 468, 470-472 
soft breaking, 464 
p decay, 232 
neutral current 
eN, 290 
ve, 280 
vN, 286 
Vee > Vee, 237 
v in matter, 395 
pion electrodynamics, 29, 136 
pion-nucleon, 103, 111 
QCD, 160 
flavor, 188 
QED, 43, 136 
spinor notation, 447 
quark, 110 
o model, 197 
standard model 
charged current, 271 
effective couplings, 309 
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366 
Lie algebra, 89-96, 128, 135 
classical, 94, 95 
exceptional, 94 
Lie group, see group 
light by light diagrams, 73 
LIGO, 430, 434 
little hierarchy problem, 428, 438, 443, 
468 
Little Higgs, 338, 428, 435, 473 
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multiverse, see cosmology 
muon (y) 
capture, 248, 249 
decay (u7 > € DeVu), 224, 232-237 
Michel parameters, 236, 255 
polarized u, 235, 256 
SPTVA analysis, 235 
inverse decay (v,e7 —> Wve), 225, 
230 
Lamb shift, 74 
HL —> ey, p —> 3e, uN — eN, 88, 366 
properties, 232 
see also magnetic dipole moment 


N 
NA48, 353 
Nambu-Goldstone boson, 119, 121-124, 
126, 141-143, 190, 338, 386, 421 
bound state, 192 
longitudinal vector mode, 124 
pseudo, 120, 122, 338, 386 
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strong interactions, 124, 172, 193, 
238 
quantum mechanics analog, 533 
Re gauge, 144-151, 347 
Nambu-Goldstone theorem, see Nambu- 
Goldstone boson 
narrow width approximation, 215, 220, 
310, 523 
naturalness, xi, 427, 429 
NZ! expansion, 187, 355 
Ne’eman, Y., 104 
neutrino 
active (ordinary, doublet), 374 
active-sterile mixing, 380, 382, 386, 
387, 407—409, 413 
anarchy models, 415 
as a probe, 373 
atmospheric, 228, 402-403 
beams, 282, 288 
counting, 385-387 
invisible Z width, 306, 309, 310, 
385 
nucleosynthesis, 386-387 
precision electroweak, 385 
decay, 385, 391, 402 
decoherence, 402, 407 
dipole moments, 390-392, 402 
Dirac mass models, 373, 375, 415-416 
extra dimensions, 416 
higher-dimensional operators, 391, 
416 
Le — Ly — L,, 383 
loop, 416 
non-holomorphic terms, 416, 464 
string instantons, 416 
Zeldovich-Konopinski-Mahmoud 
(ZKM), 383, 419, 423 
double beta decay, 376, 378, 381, 384, 
389-390, 412, 423 
alternative mechanisms, 390, 492 
BBov, BBovm, 389 
heavy v, 389, 492 
nuclear uncertainties, 389 
Earth tomography, 407 
factory, 423 
FCNC, 422 
gallium anomaly, 408, 413 
geophysical, 405 
gravitational interactions, 387 
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heavy, 385, 389 
hypothesis, 223 
leptogenesis, 339, 414, 415, 418, 433, 
499 
Majorana mass models, 373, 377, 
416-419 
higher-dimensional operators, 417 
loop, 419 
R-parity violation, 419, 475 
seesaw (Casas-Ibarra parametriza- 
tion), 420 
seesaw (double), 418 
seesaw (extended), 418 
seesaw (inverse), 419 
seesaw (lopsided), 418, 498 
seesaw (minimal), 417-418, 433 
seesaw (S'O(10)), 418, 501 
seesaw (string), 418 
seesaw (SU (2), x SU(2)r x U(1)), 
489 
seesaw (type II), 417, 501 
seesaw (type III), 418 
triplet, 416 
mass and mixing, xii, xiii, 224, 228, 
373-384, 413-422 
approximate PMNS form, 421 
BBov, 389, 412 
bimaximal, tri-bimaximal mixing, 
421 
Cabibbo haze, 421 
cosmological constraints, 388, 424 
CP violation, 382, 393, 415 
golden ratio, 421 
Jarlskog invariant, 393 
kinematic constraints, 387 
Majorana phases, 378, 381 
non-square matrix, 382 
open questions, 392, 422, 438 
PDG parametrization, 381 
PMNS matrix, 272, 381 
q — £ complementarity, 421 
Sn, An, 421 
012/00, 402, 410 
013, 393, 404—405, 410 
623 /Oatm, 402, 403, 410 
when negligible, 269, 273, 384 
mass varying, 402, 407, 409 
matter (MSW) effect, 394-396, 404 
adiabatic, 396, 402 
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level crossing, 396 
VL VS VR, 396 
resonance, 396, 402 
mixed mass models, 419 
higher-dimensional operators, 419 
mini-seesaw, 419 
mirror world, 419 
non-orthogonal, non-unitary, 394 
non-standard interactions, 402, 407 
Ve(De), 223 
vr > VR, 394 
vL — VR asymmetry, 376, 386, 409 
Va (Du); 223 
Vy, — vr interchange symmetry, 422 
v, (Dr), 223, 296 
observation, 223 
oscillations, 392-394, 396—409 
accelerator experiments, 403—404 
appearance, 392, 393 
atmospheric, 402-403, 411, 413 
dark side, 393 
disappearance, 393 
first and second class, 394 
general case, 393 
L/E dependence, 405 
long-baseline experiments, 403, 
405 
oscillation length, 392 
reactor experiments, 404—406 
short-baseline experiments, 403, 
404 
solar, 400—402, 405, 410, 413 
survival, 393 
two neutrino, 393 
vacuum, 392 
reactor anomaly, 408, 413 
reactor “bump”, 408 
relic, 390 
detection, 390 
resonant spin-flavor transitions, 391, 
402 
scattering, 225, 406, 407 
neutral current, 276, 280-283, 286- 
289 
vN, 282 
Vee — Vee , 237-238, 256, 280 
Vue —> Ve~, 280-281 
solar, 228 
alternative explanations, 402 
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astrophysical explanations, 400 
astrophysical uncertainties, 397, 
399 
CNO cycle, 399, 400 
helioseismology, 397 
LMA solution, 400, 401 
pp chain, 397 
pp, “Be, 8B, 397, 400 
spectrum, 399 
standard solar model (SSM), 397 
spectrum, 410—413 
global analysis, 410 
normal, inverted, degenerate hier- 
archies, 389, 390, 404, 405, 407, 
410 
sterile (right-handed, singlet), 258, 
309, 373, 374, 386, 404, 407—409, 
413 
keV, 388, 434, 437 
yMSM, 388 
superstring theory, 414, 416 
telescope, 406 
ultra high energy, 406-407, 423 
see also deep inelastic scattering, 
fermion mass (matrix), lepton 
mixing 
neutron, see nucleon 
neutron star, 201 
Noether current, 25, 98, 121, 128, 131, 509 
(axial) baryon number, 189 
chiral, 112 
complex scalar, 25, 30, 99 
Dirac, 102 
divergence, 99 
fermion, 43 
Hermitian scalar, 101 
SU(m), x SU(m)r, 189 
Noether theorem, 24—26, 65, 98-103 
non-covariant normalization, 7 
non-polynomial interactions, 10 
non-relativistic limit, 36, 66, 79, 86 
non-relativistic QCD (NRQCD), 187 
non-renormalizable operator, see higher- 
dimensional operator 
non-renormalizable theory, 10, 22, 224, 
276 
nonet, see symmetry: SU(3) 
normal order, 55, 82, 98, 508 
notation, 1-5 
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NOVA, 403, 411, 423 
NRQM, see quantum mechanics 
v, see electroweak interaction: scale 
nuclear 2 decay, 250-251 
Fermi transition, 251 
Gamow-Teller transition, 251 
superallowed, 251 
nuclear magneton, 78 
nucleon 
B decay (n + pe™ De), 223-224, 247- 
251, 256 
energy nonconservation, 223 
Fermi integral, 250 
inverse (e7 p + ven), 224 
v mass, 387 
weak magnetism, 248, 251 
charge radius, 74, 79 
electrodynamics, 77—80, 87 
Mp — Mn, 193, 345 
proton decay, 189, 367, 437, 475, 493, 
496 
experimental limits, 493, 500 
s quark, 186 
spin distribution, 186 
SU (2), 103 
number operator, 8, 508 
Noether charge, 25 
NuMI, 403 
NuTeV, 288, 299 


(0) 
oblique parameters, see S,T,U parame- 
ters 
octet, see symmetry: SU (3) 
off-axis beam, 404 
O(m), see symmetry 
Q, see cosmology: energy density 
Q (sss), 108, 110, 162 
w resonance, 108, 114 
OPAL, 305 
OPERA, 403, 408 
operators and matrices 
adjoint, 2 
anti-commutator, 2, 7—9, 92 
commutator, 2, 7—9, 90-92 
exponent, 90, 127 
Hermitian, 4, 90, 91 
orthogonal, 94 
special, 91 
symplectic, 94 
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trace, 2 
Dirac, 36-37 
group generators, 91 
transpose, 2 
unitary, 91, 92, 94 
optical theorem, 513 
oscillations, see kaon, B meson, neutrino 
OZI (Zweig) rule, 187, 204 


P 
P, see space reflection 
Palo Verde, 405 
parameter 
bare, 63 
physical, 64, 65 
parastatistics, 162 
Paris, 290 
parity, see space reflection 
parity violating electron scattering 
(PVES), see asymmetries: po- 
larization 
partial rate asymmetries, 60, 499 
partial wave expansion, 514 
Particle Data Group, 315 
parton distribution function, 179, 206, 
213, 221, 311 
generalized, 186 
global analysis, 182, 185, 285 
gluon, 181, 185 
hadron collider, 206, 210 
moments, 185 
momentum, 179, 181 
non-singlet, 185 
quark, 179 
singlet, 185 
splitting, 183, 185 
sum rules, 180 
parton luminosity, 213, 214, 221, 222 
parton model, 15, 161, 170, 173, 178 
deep inelastic scattering, 178-182, 
284, 287 
heavy quark decays, 253 
QCD improved, 183 
spin-0, 179 
spin-4, 179 
T decay, 252 
Pauli matrices, 4, 36, 90, 91 
Pauli spinor, see spinor: Pauli 
Pauli, W., 223 
Pauli-Dirac representation, 36, 84 
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Pauli-Villars, see regularization 
PCAC, see current 
PDF, see parton distribution function 
Peccei-Quinn mechanism, see symmetry 
penguin, see loop diagrams 
pentaquark, see hadron: exotic 
PEP, 71, 290, 293, 356 
perturbation theory, 11-13, 159 
time dependent, 87 
PETRA, 71, 162, 290, 293, 294 
phase shift, 354, 514 
phase space, 8 
2-body, 17, 20, 515 
3-body, 516-518 
energies, 517 
factorization, 516 
Kes, 256 
u decay, 233, 255 
narrow resonance, 516 
Tez, 244, 256 
cylindrical coordinates, 208 
n-body, 515 
phase transition 
chiral, 201 
deconfinement, 201 
electroweak, 337, 430, 432, 433, 477 
(85) resonance, 108, 164 
photon (A, y), see QED, electroweak in- 
teraction 
photoproduction, 186 
physical constants, 1 
PIBETA, 244 
PIENU, 240 
pile-up, 213, 217 
PINGU, 407 
pion 
as bound state, 75 
charge radius, 76 
Compton scattering, 33, 48 
decay, 224 
Te2/Ty2 ratio, 240 
Te2 (Tu2, Te2), 165, 238-241 
Tez helicity suppression, 240 
Tez, 242-244, 256 
mw — 2y, 165 
decay constant, 165, 172, 191 
chiral breaking, 192, 199, 240 
measurement, 240 
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electrodynamics, 29-34, 74-76, 85, 
136 
field conventions, 23 
phase convention, 103, 128 
pion-nucleon o term, 204 
pion-nucleon interaction, 20, 74, 78, 
103, 129, 192 
soft pions, 200 
properties, 238, 239 
pseudoscalar, 52 
self-interactions, 74, 103, 104, 129, 
136 
soft pion theorems, 122, 193, 199, 200 
SU(2), 103 
pion beta decay, see pion decay: Tes 
Planck, 387 
Planck constant, 2, 22, 67 
Planck scale, see Planck constant 
Pry, see chiral projection operators 
Poincaré invariance, see symmetry 
polar angle, 17 
polarization, 27, 179 
circular, 27 
fermion, 37, 48-49, 86 
linear, 27 
longitudinal, 28, 142, 144 
high energy, 29, 226, 327 
sum, 28, 29 
gluon, 168 
transverse, 27, 28 
pomeron, 187, 212 
potential, 10, 87, 114 
atomic parity violation, 291, 370 
boundedness, 115, 124-125 
classical, 114 
confinement (linear), 188 
constant term, 115 
Coulomb, 32, 66, 85 
effective, 68 
extremum, 115 
flat direction, 124, 481 
four-fermi, 87 
Higgs, 260 
O(4) symmetry, 262 
running coupling, 324 
Higgs (with SSB), 265 
induced cubic, 115, 116, 120, 144, 
145, 149-151, 266 
maximum, 119, 126 
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Mexican hat, 118, 532 
minimum, 114, 118, 119, 123, 125- 
127 
global, 116 
local, 116, 124, 326 
MSSM, 466 
saddlepoint, 126 
scalar exchange, 87 
SU (5), 496, 497 
supergravity, 454 
supersymmetry, 448, 449, 454, 458 
U(1)’, 481 
vector exchange, 88 
Yukawa, 5, 19, 88, 187 
potential scattering, 18-20 
propagator, 7 
Breit-Wigner, 294, 306, 523 
ghost, 147, 278 
gluon, 157 
Majorana, 384 
mass eigenstates, 81 
spin-0, 11, 24 
Re gauge, 147, 278 
spin-1, 28, 29, 66 
Re gauge, 146, 278 
spin-5, 38 
proton, see nucleon 
proton decay, see nucleon: proton decay 
pseudo-Goldstone boson, see Nambu- 
Goldstone boson 
pseudorapidity, 208 
pseudoscalar 
interactions, 106 
octet, 105, 108, 172, 195, 243 
dual interpretation, 196 
Goldstone bosons, 190 
quark model, 109 
pseudoscalar decay constant, see pion: de- 
cay constant 
pseudoscalar mixing angle, see n: n — n 
mixing 
PSI, 74, 234, 236, 244, 366 
PTOLEMY, 390 
PVDIS, 291 


Q 
QCD, see quantum chromodynamics 
QED, see quantum electrodynamics 
quantum chromodynamics, xiii, 68, 91, 
103, 157, 159-161 
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color, 157, 188 
average/sum, 166 
evidence, 162-165 
color factors, 165 
color identities, 165-166 
confinement, 158, 170, 187 
flux tube, 188 
evidence, 161-165 
history, 157 
parton processes, 165-170, 186, 202, 
210, 214 
scale parameter, 160, 171, 172 
sum rules, 187 
quantum electrodynamics, xiii, 29, 63-80, 
136 
and strong interactions, 63, 74-80, 
176 
high energy, 71 
Lagrangian, 43 
tests, 69-74 
quantum mechanics, 7, 63, 87, 89, 93, 103, 
104, 511 
measurement theory, 351 
symmetry breaking, 531-533 
quark, 75, 77, 79, 109-110, 114, 157, 159- 
161 
evidence, 161-162 
integer charge, 163, 201, 285, 493 
masses 
and pseudoscalar masses, 193 
bare, 160, 257 
constituent, 113, 130, 172, 190 
current, 113, 160, 192, 269 
deep inelastic scattering, 181 
heavy quarks, 194 
light quarks, 194 
Mu — Ma, 193 
My, = 0 and strong CP, 429 
pole, 194 
running, 194, 315, 350 
model of hadrons, 109-110, 130, 158, 
161, 196, 200 
sea, 180 
valence, 164, 180 
non-observation, 162 
quark mixing, 224 
Cabibbo rotation, 228, 273 
Cabibbo/weak/CKM universality, 
229, 341, 342, 492, 505 
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CKM matrix, 230, 253, 272, 339-343 
CP violation, 273, 433 
global analysis, 361 
Jarlskog invariant, 340, 344, 361 
PDG parametrization, 340 
Wolfenstein parametrization, 340, 
361 
in SU(2), x SU(2)r x U(1), 490 
unitarity, 227, 274, 340 
unitarity triangle, 341, 343-344 
Ved; Ves, 342 
Vids Vis, Viv, 342 
Vuo, Veb, 342 
from B decay, 254 
Ve» from vN, 286 
Vua, 341 
from 8 decay, 251 
Vus, 341 
from Ke3, 244 
see also fermion mass (matrix) 
quark-hadron duality, 159, 216 
Qw, see atomic parity violation 


R 
R field, see field: chiral 
radiative corrections 
atomic parity violation, 291 
B decay, 251, 256 
Amg, 350 
u decay, 234, 256 
finite in Fermi theory, 234 
neutral current, 304 
precision electroweak, 279, 296, 306 
box, 301 
My, 301, 315 
mixed QCD-EW, 299, 302 
mz, 301, 303 
oblique, 301, 315, 317, 338 
QCD, 302 
reduced QED, 300 
vertex, 301 
Z — bb vertex, 299, 301 
weak universality, 343 
radiative hierarchies, 427 
radiochemical detector, 399, 400 
Randall-Sundrum models, see extra di- 
mensions: warped 
rank, see group 
rapidity, 5, 208, 211, 221, 444 
rapidity gap, 212, 219, 220 
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real scalar field, see field: Hermitian 
reduced matrix element, 107, 108 
reduction formula (LSZ), 527 
reflection, see space reflection, group: 
O(m) 
Regge theory, 187, 212 
regular representation, see representation: 
adjoint 
regularization 
dimensional, 66, 163, 298, 477 
dimensional reduction, 477 
lattice, 298 
Pauli-Villars, 66, 298 
relativity, 2-3, 63 
relaxion, 429, 430 
relevant operator, 80 
remnants, xi, 438 
RENO, 405 
renormalizable gauge, 148 
renormalizable operator, 10, 80 
renormalizable theory, 10, 22, 135, 141, 
152 
spontaneously broken gauge, 144 
renormalization, xiii, 63, 70, 183 
BPHZ scheme, 298 
constant, 64 
counterterm, 298 
minimal subtraction scheme, 69, 235, 
297, 298 
on-shell scheme, 69, 297, 298 
parameter, 22, 64, 65 
QED, 63 
scale, 69, 157, 170, 215 
wave function, 22, 64, 66, 452 
renormalization group equations, 23, 68, 
170-173, 324, 477, 499 
fixed points, 371 
conformal, 502 
hadronic uncertainties, 68 
mass, Higgs, Yukawa, soft, 324, 443, 
452, 464, 498 
v mass, 421 
thresholds, 68, 171, 500 
representation, 89—91, 96 
adjoint, 91-93, 95, 97, 127, 154, 479 
anti-fundamental, 111 
(anti)symmetric, 111, 131 
complex, 93 
conjugate, 92, 93, 97 
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defining, 91, 93-95 

double-valued, 94 

equivalent, 92 

fundamental, 93, 94, 104, 110 

inequivalent, 92 

irreducible, 89, 93 

real, 93, 97 

reducible, 93 

spinor, 94 

supersymmetry, 448—449 

tensor, 94 

tilde (SU(2)), 260, 365, 374 

vector, 94, 154 
RGE, see renormalization group equa- 

tions 

RHIC, 186, 201 
p resonance, 75, 76, 108, 114, 161, 172 
p parameters 

radiative corrections, 303 

Po, 317, 337, 367 
p—w-— ọ mixing, 193, 196, 204 
Riemann Zeta function, 235 
right-handed, see helicity: positive 
Rosenbluth cross section, 79, 176 
rotation, see symmetry: rotational 
Rutherford cross section, 85 
Rutherford experiment, 161 
Rutherford Laboratory, 407 
Rg gauges, 28, 144-151 

standard model, 277-279 


S 
S matrix, see transition amplitude 
S-matrix theory, 187 
S,T,U parameters, 315, 317-319, 474 
chiral fermions, 318 
extensions, 319 
non-degenerate multiplets, 317 
SAGE, 400, 408 
Salam, A., 226 
Sanford Laborary, 404 
scattering amplitude, see transition am- 
plitude 
Schrödinger equation, 59 
Schur’s lemma, 94 
Schwinger, J., 63 
screening, 68 
seagull diagram, 30, 33, 138, 141 
seesaw, see fermion mass (mixing), neu- 
trino 
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self-energy correction, 64—67, 157 
quadratic divergence, 67 
see also radiative corrections: oblique 
shift transformation, 121, 142 
g, see Pauli matrices 
o model 
linear, 197-199, 204 
nonlinear, 199-200 
o””, see Dirac matrices 
similarity transformation, 92 
simple parton model (SPM), see parton 
model 
simplified models, 465 
single-particle state, 7 
singularity, see divergence 
sin? Ow, see weak angle 
Skyrme model, 187 
SLAC, 71, 161, 170, 186, 205, 227, 290, 
292, 293 
SLC, 71, 205, 227, 293, 296, 305, 315, 356 
SLD, 305 
SM, see standard model 
s”, see spin four-vector 
Sn, see symmetry 


SNO, 400, 401 
soft collinear effective theory (SCET), 
187, 277, 339 


soft pion theorems, see pion, 277 
soft radiation, 64, 163 
solid angle, 18 
SO(m), see symmetry 
Soudan, 402, 403 
S'p(2m), see symmetry 
space reflection, 48-54, 65, 77, 86, 104, 
105; 132, 170, 177 
antifermion, 51, 61 
axial vector, 50, 256 
Tez Keo decay, 239 
form factors, 53, 528 
pseudoscalar, 50, 52, 104 
scalar, 50 
two-component, 60, 447 
vector, 50 
violation, 49, 78, 86, 177, 224, 228, 
362 
B decay, 231 
charged current, 231 
deep inelastic scattering, 283 
uu/7 polarization, 71, 235, 256 
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neutral current, 238, 275 
NN interaction, 230 
O(a), 188 
T — 0 puzzle, 231 
sparticle, see supersymmetry: superpart- 
ners 
SPEAR, 293 
sphaleron, 432, 499 
spherical coordinates, 17 
spin and statistics theorem, 110, 162 
spin average/sum, 27, 45 
spin effects, 48-49, 86, 162, 169 
B decay, 250 
form factors, 79 
u decay, 235 
neutral current, 276 
supersymmetry, 473 
t decay, 370 
two-component, 62 
spin four-vector, 37, 50 
spinor 
chiral, 41, 84 
Dirac, 35, 37—42, 48-49 
helicity, 38, 39, 49, 84 
Pauli, 5, 38 
see also field: Weyl 
splitting functions, see parton distribu- 
tion function 
spurion, 128, 414, 476 
SSB, see symmetry breaking: sponta- 
neous 
SSM, see neutrino: solar 
standard model, xi, xii, 93, 116, 152, 257— 
279 
and neutrinos, 373 
anomaly free, 259 
problems, 425—438 
fermion, 426 
gauge symmetry, 426 
gravity, 430 
Higgs/hierarchy, 336, 418, 427- 
429, 438, 439 
parameters, 425 
strong CP, 429 
see also QCD, QED, electroweak in- 
teraction 
statistical factor, 17, 525 
step function, 5 
strange (s) quark, 109, 157 
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strangeness, 104, 229, 345 
violation, 345 
string theory, see superstring theory 
strong CP problem, see CP: strong 
strong hypercharge, 104, 109, 193 
strong interaction, 74-80, 91, 157-158 
long distance, 157-159, 170, 187-188 
short distance, 159, 170, 173 
symmetries, 91, 188-190 
breaking, 190, 193 
chiral, 112, 122, 124, 165, 172, 187- 
190, 200, 250 
discrete, 49, 65, 113, 161, 188 
flavor, 91, 103, 113, 157, 159-161, 
188-190 
gauge, 91 
strangeness, 345 
structure constants, 90, 93, 127 
structure functions, see deep inelastic 
scattering 
Sttickelberg mechanism, 29 
SU(2), see symmetry 
SU(2), x SU(2)r x U(1), 236, 261, 316, 
341, 343, 365, 390, 478, 487-492 
breaking scale, 489, 491 
discrete symmetries, 487, 491 
experimental constraints, 492 
gauge interactions, 488, 490 
Higgs sector, 488 
left-right symmetry, 236, 487, 491 
manifest /pseudo-manifest, 492 
v seesaw, 489 
SO(10), 487, 489 
W — Wp mass/mixing, 236, 251, 343, 
489, 505 
warped extra dimensions, 487 
Yukawa couplings, 490 
SU(3), see symmetry 
SU(3). x SU(2), x U(1)y, see standard 
model 
SU(3) x SU(2) x U(1), see symmetry 
SU(m), see symmetry 
super-renormalizable operator, 22, 80 
supergravity, 430, 448, 453, 458 
SuperKamiokande, 399, 402, 403, 493, 500 
supernova 
1987A, 387, 399 
diffuse v background, 388 
galactic, 388, 407 


superstring theory, xi, xii, 80, 94, 414, 416, 


426, 430, 432, 458 
and QCD, 187 
F-theory, 502 
heterotic, 485, 502 
instantons, 414, 416, 418, 502 
intersecting brane, 414, 416, 427, 502 
landscape, xi, xii, 415, 430, 431 
type IIA,B, 502 
see also neutrino: superstring theory 


supersymmetry, xi, xii, 62, 124, 125, 212, 


316, 320, 341, 365, 419, 428, 438- 
443 
algebra, 448 
auxiliary field, 449, 450 
breaking/mediation, 366, 441, 458, 
461-463 
A terms, 464 
anomaly mediation, 462, 473 
CMSSM, 464 
D-term, 461 
D-term mediation, 463 
dynamical, 462 
F-term, 461 
gauge mediation, 462 
gaugino condensate, 462 
gaugino mediation, 462 
gaugino unification, 473 
gravitino mass, 437, 462 
hidden sector, 441, 462 
higher-dimensional operator, 476 
minimal supergravity, 464, 473 
non-holomorphic terms, 464 
pMSSM, 465 
radion mediation, 463 
simplified models, 465 
soft (parameters), 441, 461, 462 
spontaneous, 461 
sum rules, 441, 461 
super-Higgs mechanism, 461 
supergravity mediation, 462 
Z’ mediation, 463, 487 
compressed spectrum, 474 
CP violation/EDM, 442, 474 
decoupling limit, 442 
dileptons, 473 
Dirac gaugino, 473, 475 
Fayet-Iliopoulos term, 458 
FCNC, 442, 462, 471, 474 
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gauge kinetic function, 458 
generators, 448 
Gp — 2, 73, 439 
Grassman variable, 449, 503 
integration, 450 
Higgs, 465-470 
doublets, 441 
heavy, 474 
interactions, 441, 468 
Ma, 466, 469 
mass, 255, 439 
maximal mixing/no mixing, 468 
precision constraints, 316 
radiative electroweak breaking, 
443, 467 
radiative mass corrections, 467, 
504 
spectrum, 467 
tan p, 441, 466 
theoretical mass bound, 441, 467, 
477, 481 
interactions 
abelian, 456—459 
Dirac fermion, 460 
non-abelian, 459—460 
superpotential, 453—456 
Kahler potential, 453 
lightest supersymmetric partner 
(LSP), 435, 442, 473 
MSSM, 151, 261, 271, 326, 337, 442, 
463-474 
decoupling limit, 337, 470, 474 
notation, 463 
superpotential, 463, 464 
Yukawa couplings, 469 
u problem, 433, 440, 442, 476—477 
Giudice-Masiero mechanism, 476 
singlet extension of MSSM, 337, 
476, 481 
next lightest sparticle (NLSP), 474 
non-renormalization theorem, 440, 
452 
O’Raifeartaigh model, 461 
parton processes, 167 
precision electroweak, 442, 474 
R-hadron, 221 
R-parity, 211, 220, 442, 475 
violation, 236, 365, 390, 475 
R-symmetry, 455, 473, 477 
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singlet extended 
U(1)’, NMSSM, nMSSM, 326, 468, 
476 
split, 431 
stealth models, 475 
superfield, 451-453 
chiral, 451-453 
D-component, 453, 457, 459 
F-component, 452, 454 
vector, 451, 453, 456 
supergauge transformation, 457 
supermultiplet 
chiral, 449 
gravity, 449 
massive vector, 459, 505 
vector, 449 
superpartners, 73, 317, 440-441 
chargino, 441, 471—474 
gaugino (G, W, B), 440, 457 
gluino, 203, 471 
Goldstino, 459, 461 
gravitino, 437 
Higgsino, 441, 476 
left-right mixing, 468, 470 
mass limits, 474—475 
neutralino, 441, 471—474 
scalar lepton (slepton), 440, 470 
scalar quark (squark), 167, 203, 
440, 470 
singlino, 505 
superpotential, 452 
superspace, 449 
transformation, 450—452 
trileptons, 473 
vacuum, 448 
charge/color/B/L breaking, 465 
metastable, 462 
Wess-Zumino gauge, 457 
Wess-Zumino model, 454 
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baryon number, 189 

lepton number/flavor, 364, 366, 
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P, C, T, CP, 188, 339, 363 
chiral, 110-113, 152 

fermion mass, 112, 122 
continuous, 89, 96, 123 
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cyclic (Zn), 90, 113, 154 
discrete, 49-60, 96, 98, 113-114 
gauge, 153 
spinor notation, 447 
family, 259, 261, 365, 385, 414, 420, 
427, 478 
field theory, 96-114 
flavor, 91, 157 
G-parity, 57, 113, 131, 248 
form factors, 529 
gauge, 25, 89, 91, 135-136 
abelian, 28, 136-137 
chiral, 136, 137 
direct product, 140-141 
gauge boson mass, 140 
non-abelian, 137—140, 159 
global, 25, 89, 135 
and gravity, 414 
internal, 89, 96 
isospin, 75, 89, 91, 97, 103-104, 113, 
128, 159, 181 
Lagrangian, 98 
Lorentz, 3, 53, 75, 89, 90, 114, 443- 
444 
violation, 60, 402 
Nambu-Goldstone realization, 118, 
122 
non-chiral, 112, 113 
Vu — Vr interchange, 422 
O(m), 94, 101, 118 
Peccei-Quinn, 429 
permutation (Sn, An), 90, 421 
Poincaré, 3, 57, 443-444 
quaternion, 90 
rotational, 50, 54, 89, 91, 98, 103, 
104, 444 
SO(m), 94, 127, 154 
Sp(2m), 94 
space-time, 89, 96 
SU(2), 91-93, 103, 128 
multiplication rule, 127 
SU(3), 91-93, 103-110, 159, 162, 251 
nonet, 105, 108, 109, 196-197 
octet, 104, 129 
SU(3) x SU(2) x U(1), 93 
SU(6), 110, 130 
SU(m), 93-96, 99, 110, 131, 171 
SU(m), x SU(m)pr, 112, 131, 189- 
194 


634 


and weak current, 241 
translation, 3, 25, 53, 89, 114, 444 
U(1), 23, 24, 27, 29, 91, 92, 99, 117, 

121. 136, 171 
as rotation, 102, 118 
as subgroup, 137 
monopoles/strings, 117 
U(m), 91, 94, 99, 130 
U-spin, 109, 130 
V-spin, 109 
vacuum, 97 
weak isospin, 91 
Wigner-Weyl realization, 118, 122 
chiral, 190, 198 
see also QED, QCD, electroweak 
interaction, U(1)’, SU(2)z x 
SU(2)r x U(1), grand unifica- 
tion, C, P, T, CP, CPT 
symmetry breaking, 114-124 
charge non-conservation, 131 
chiral, 112, 120 
fermion mass, 120, 122, 192, 199, 
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non-analytic, 247 
electromagnetism, 104, 108 
explicit, 98-100, 113-115, 117, 122 

Nambu-Goldstone, 120 

Wigner-Weyl, 118 
isospin, 73, 103, 104, 108, 114, 193- 

194 
in nuclei, 193 
leakage from multiplet, 99, 196, 247 
quantum mechanics, 531-533 
quark masses, 103, 104, 108, 113, 160, 
190-194 

soft, 118 

spontaneous, xii, 29, 80, 98, 114, 115, 
118, 122-124, 131 

chiral, 120, 165, 172 

discrete, 117 

Higgs alternatives, 255, 264, 327, 

337-339, 428 
Higgs mechanism, xii, 29, 122, 124, 
141-144, 226, 261-263, 323 
polar basis, 121, 122, 126, 131, 
143 

rectangular basis, 120, 131 
SU(2) x U(1), 257 
SU(3), 107-109, 193-194, 196-197 
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T 
T, see time reversal 
T, see time reversal: Dirac matrix 
’t Hooft-Feynman gauge, 148, 347 
T matrix, see transition amplitude 
T2K, 403, 405, 411, 423 
tadpole diagram, 22 
tadpole operator, 10, 116 
TASSO, 162 
T, see Pauli matrices 
tau (T) 
decay, 223, 224, 252 
Michel parameters, 252 
v, mass, 253 
T —> Tvr, 256 
universality, 253 
properties, 252 
SU(2) x U(1) assignment, 295 
technically natural parameter, 497 
technicolor/extended technicolor, 319, 
337 
tensor notation, 96, 110, 160 
tensor operator, 88 
tetraquark, see hadron: exotic 
Tevatron, 205, 216, 217, 227, 290, 303, 
305, 311, 313, 323, 328, 332, 356, 
486 
TEXONO, 280 
textures, 117, 420-422 
Oakes relation, 424 
O(x), see step function 
0 vacuum, see tunneling: instantons 
0e (Cabibbo angle), see quark mixing 
Jacp, see CP: strong 
Thomson cross section, 48 
’t Hooft, G., 226 
(3*, 3) + (3, 3*) model, 190-194 
three-vector, see vectors 
tilde trick, 261, 440, 488 
time evolution operator, 511, 513 
time reversal, 50, 58-60, 236, 242, 339 
antiunitary, 59 
Dirac matrix, 59 
form factors, 529 
quantum mechanics, 58 
two-component, 61, 447 
violation, 78, 220, 339, 361-364 
entanglement, 361 
K? oscillations, 361 
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pseudo, 362 
triple correlations, 362 
see also electric dipole moment 
time-ordered product, 11, 38 
time-varying coupling constants, 434 
Tomonaga, S., 63 
top (t) quark, 157, 255, 296 
decays, 217, 255, 315 
discovery, 313 
mass prediction, 227, 296, 313, 315 
radiative electroweak breaking, 443, 
467 
top-color, 337 
topological defects, 117 
trace identities, 36-37 
transition amplitude, 11, 15, 87, 121, 148, 
161, 362, 511, 513 
translation invariance, see symmetry: 
translation 
transverse variables, 208, 209, 211-212, 
215 
triangle diagram, see anomalies 
TRISTAN, 71, 290, 293 
TRIUMF, 236, 240 
Troitsk, 387 
tunneling 
instantons 
B + L, 189, 432 
U(1)a, 189, 195 
metastable vacuum, 116, 326 
phase transition, 433 
Twin-Higgs, 428 
TWIST, 236 
two-component notation, 41, 60-62, 86, 
374 
dotted/undotted indices, 445 
raising/lowering indices, 445 
spinor, 445—448 


U 
U(1)a problem, see axial baryon number 
U(1), see symmetry 
U(1)’, 220, 261, 280, 297, 314-316, 341, 
415, 477, 479—487 

collider, 486 

dark photon, 73, 479, 487 

diagnostics, 486 

Es models, 484 

exotic fermions, 487 

FCNC, 487 
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gauge interactions, 479 
hidden sector couplings, 483, 487 
Higgs/neutralino sector, 487, 505 
intermediate scale, 481 
kinetic mixing, 482 
fractional electric charge, 482, 487 
left-right models, 483 
leptophobic, 482 
minimal unification models, 485 
motivations, 479 
u problem, 481, 485 
neutral current /LEP 2, 485 
v mass, 487 
sequential, 483 
Z-pole, 486 
Z — Z' mass/mixing, 480 
UAL, 265 
UA2, 265 
ultraviolet completion, 420 
ultraviolet cutoff, see divergence: ultravi- 
olet 
U(m), see symmetry 
underlying events, 213, 217 
uniqueness, Xi 
unit sphere (surface area), 5, 521 
unitarity, 161, 513-514 
S-wave, 225, 327 
violation 
Born approximation, 225 
Fermi theory, 225 
Higgs, 326 
intermediate vector boson theory, 
226 
unitary gauge, 29, 143, 148 
multiscalar interaction, 148, 155 
standard model, 263-276 
units, 1, 206, 207 
energy, 1 
unpolarized rate, see spin average/sum 
up (u) quark, 109, 157 
YT (bb), 164, 187, 188, 203, 255, 356 


V 
V — A, see current 
vacuum, 7 
vacuum condensate, see composite opera- 
tor 
vacuum expectation value, 10, 114, 118, 
141 


vacuum polarization, see self-energy 
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vacuum polarization functions, 318 

vacuum saturation approximation, 349 

vacuum stability, 124, 325, 335, 418 

vector boson fusion, 219-220, 328 

vector fermion, 112 

vector field, see field: gauge 

vector meson dominance, 186 

vector mesons, 108 

vector potential, 26 

vector symmetry, see symmetry: non- 
chiral 

vectors, 2—3 

Veltman, M., 226 

Veneziano (dual resonance), 187 

vertex correction, 64, 66, 77, 519 

VEV, see vacuum expectation value 


W 
W boson, 225, 302-303 
and color, 164 
decay widths, 304-305 
discovery, 227, 265 
helicity, 322 
mass, 264, 265, 313 
propagator effects, 314 
W+W- — WtW, 219, 226, 327 
strong coupling, 327 
see also electroweak interaction: 
charged current 
Ward-Takahashi identity, 65, 66 
weak angle, 72, 298 
atomic parity violation, 291 
defined, 264 
global analysis, 296, 315 
Møller scattering, 292 
Vue, 280 
Vu N, 288 
renormalized, 298-300 
effective (52), 298, 307 
MS (sinz) 298 
on-shell ( s#,), 298 
Z-mass (s4,,), 298 
running, 300 
weak charged current (WCC), see elec- 
troweak interaction 
weak equivalence principle, 387, 402 
weak hypercharge, 257, 258 
weak interaction, see electroweak interac- 
tion 
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weak neutral current (WNC), see elec- 
troweak interaction 

weight diagram, 105, 108, 110, 229 

Weinberg, S., 226 

Weyl spinor, see field: Weyl 

Wick ordering theorem, 512 

Wick rotation, 348, 439, 521 

Wigner-Eckart theorem, 104, 107, 128 

SU(3) octets, 107 

WIMP, see cosmology: dark matter 

WMAP, 387 

Wr, see SU(2)7 x SU(2)r x U(1) 

Wu, C.S., 231 


Y 
Yang, C.-N., 231 
Young tableaux, 95 
Yukawa interaction, 49, 52, 103, 128, 146, 
159 
standard model, 260, 270 

diagonal for mass eigenstates, 271 

running coupling, 324 

Yukawa matrices, 260 
Yukawa potential, see potential: Yukawa 
Yukawa, H., 225 


Z 
Z boson 
decays, 86, 306 
discovery, 227, 265 
mass, 264, 265, 298, 302-303 
measurement, 308 
predicted, 226 
propagator effects, 293 
Z-pole, 305-313 
asymmetries, 306, 309-313, 370 
effective couplings, 299 
future experiments, 320 
invisible width, 309 
lineshape, 306-309, 524 
measurements, 276, 308 
T polarization, 311 
widths, 306-309 
see also electroweak interaction: neu- 
tral current, precision tests, 
asymmetry 
Z burst, 390 
zero-point energy, 508 
ZFITTER, 298 
Zn, see Symmetry 
Z' boson, see U(1)’ 


